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MOLECULES 

The present invention describes the chromosomal location of one or more genes involved 
in obesity. 

In recent years, there has been an increase in the population of patients with what are. 
termed "life-style related" diseases, such as obesity and hyperlipidemia. Obesity refers to a 
physical state in which the stored fat, constituted mainly of triglycerides, is accumulated 
excessively in the body. A consequence of obesity is an increased risk of arteriosclerosis, 
fatty liver, hypertension, diabetes, coronary heart disease, stroke, gallbladder disease, 
osteoarthritis, respiratory problems and some types of cancer (Visscher & Seidell, Annu Rev 
Public Health (2001); 22(1): 355-75). It has been shown recently that there can also exist a 
strong genetic component underlying the development of obesity. The role of certain genes, 
e.g., agouti, leptin {ob) and leptin receptor (db), in the regulation of body fat distribution has 
been described (Friedman & Halaas, Nature (Oct 1998); 395(6704): 763-70; Ahima et al, 
Front Neuroendocrinol (Jul 2000); 21(3): 263-307). In humans, mutations in leptin, leptin 
receptor, melanocortin 4- receptor (MC4-R) and peroxisome proliferator-activated receptor 

(PPAR) y2 genes have been shown to be important component in patients with severe obesity 

(Clement et. al; Lit J Obes Relat Metab Disord 2000 Mar ;24(3): 391-3, Yeo et. al; Nat 

Genet 1998 Oct;20(2): 111-2). 

The identification of other genes involved in the development of obesity will provide a 

better understanding of the disease, and an opportunity to develop more specific and effective 

drugs. 

The present invention is based on the discovery of a chromosomal translocation which 
has a predisposing effect on the development of obesity. The translocation exists between 
chromosome 4 and chromosome 15. Specifically, chromosome 4 has been broken at 
cytoband 4q22.3 and chromosome 15 has been broken at cytoband 15q22.2. The outcome of 
the translocation event is that at least two genes are affected in persons having this 
translocation and that persons having this translocation are more susceptible to develop 
obesity. 

Two of the genes affected by the translocation are uncoordinated 5C (UNC5C) 
residing on chromosome 4 and RAR-like orphan receptor alpha 1 (RORal) residing on 
chromosome 15. 
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The breakpoint at chromosome 4 occurs between the genes pyruvate dehydrogenase 
A2 (PDHA2) and UNC5C. While neither of these genes are destroyed in the mutation 
event, UNC5C transcription is upregulated in adipocytes. A cDNA encoding UNC5C is set 
out as SEQ ID NO: 9 (with the first nucleotide of the UNC5C coding region accorded 
position 155), and its gene product is set out as SEQ ID NO: 10. 

There is an alternative splice version produced from the UNC5C gene, where the 
exons 11 and 12 are spliced together from internal sites producing a transcript with a 
truncated open reading frame and a much shorter protein. A sequence containing this splice 
event has been detemined and submitted to the GENESEQN d'atabase and has accession 
number AAC90914 (Fig. 2 and Fig.10). We confirmed the presence of this alternative 
transcript with the use of PCR on human brain material. A putative "in silico" generated full 
length clone of the alternative transcript is set out as SEQ ID NO: 13 (with the first nucleotide 
of the sUNC5C coding region accorded position 1) and its gene product is set out as SEQ ID 
NO 14. As used herein "sUNC5C" refers to this shorter splice version of UNC5C unless 
otherwise stated. The sequence AAC90914 is a truncated version of sUNC5C. 

The breakpoint at chromosome 15 is within the first intron of RORal and this 
mutation event destroys the transcription of the RORal isoform. A cDNA encoding RORal 
is set out as SEQ ID NO: 5 (with the first nucleotide of the RORal coding region accorded 
position 102), and its gene product is set out as SEQ ID NO:6. The translocation. results in the 
first exon of RORal being spliced into the second exon of UNC5C, resulting in the 
expression of two novel transcripts which are fusions between RORal andUNC5C (long and 
short). These two novel transcripts are expressed in affected patients adipocytes. The first 
657 amino acids are identical for both fusion proteins: 

• The first transcript results in the translation of a long fusion protein between RORal - 
UNC5C (termed herein "lRORal-UNC5C"). A cDNA encoding lRORal-UNC5C is set 
out as SEQ ID NO: 1, and its protein fusion product as SEQ ID NO:2. In SEQ ID NO: 1, 
nucleotide positions 1-233 represent that part of the transcript from RORal . Nucleotide 
positions 234-2986 represent that part of the transcript from UNC5C. In SEQ ID NO: 2, 
amino acid positions 1-55 represent that part of the protein fusion product derived from 
RORal DNA. Amino acid positions 56-383 and 403-964 represent that part of the protein 
fusion product derived from UNC5C DNA. The lRORal-UNC5C transcript is produced by 
splicing RORal exonl with UNC5C exon 2-16 together with a fusion specific exon situated 
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between UNC5C exons 7 and 8. The fusion specific exon is an extra exon of 57bp which 
encodesfor an extra 19 amino acid residues and resides within theIRORal-UNC5C 
transcript at nucleotide positions 1220-1276 of SEQ ID NO: 1. In SEQ ID NO:2, amino acid 
positions 384-402 represent that part of the protein fusion product derived from the fusion 
specific exon. It is to be noted that the first coding nucleotide for the amino acid position .384 
comes from UNC5C exon 7. 

The second transcript results in the translation of a shorter fusion protein 
between RORocl-UNC5C (known herein as the "sRORal-UNC5C"). A cDNA encoding 
sRORal-UNC5C is set out as SEQ ID NO: 3, and its protein fusion product as SEQ ID NO: - 
4. In SEQ ID NO: 3, nucleotide positions 1-233 represent that part of the transcript from 
RORal. Nucleotide positions 234-2546 represent that part of the transcript from TJNC5C. In 
SEQ ID NO:4, amino acid positions 1-55 represent that part of the protein fusion product 
derived from RORal DNA. ' Amino acid positions 56-383 and 403-669 represent that part of 
the protein fusion product derived from sUNC5C DNA. As for the long form, a fusion 
specific exon is situated between UNC5C exons 7 and 8. The fusion specific exon is an extra 
exon of 57bp which encodes for an extra 19 amino acid residues and resides within the 
sRORol-UNC5C transcript at nucleotide positions 1220-1276 of SEQ ID NO:3. In SEQ ID 
NO:4, amino acid positions 384-402 represent that part of the protein fusion product derived 
from the fusion specific exon. It is to be noted that the first coding nucleotide for the amino 
acid position 384 comes from UNC5C exon 7. The sRORcd-UNC5C transcript only differs 
from lRORcd-UNC5C in the way that UNC5C exons 11 and 12 are spliced together. 
sRORccl-UNC5C uses internal splice sites within UNC5C exon 11 and 12 so that the 3' part 
of exon 11 and the 5' part of exon 12 is omitted (Figs. 4, 6 and 7). Thus, sRORal-UNC5C 
has a destroyed open reading frame resulting in the deletion of 307 amino acids and replacing 
them with 12 amino acids from another reading frame. 

As used herein "RORal -UNC5C" refers to both the short and large fusion protein, 

unless otherwise stated. 

The genomic DNA sequence at the fusion site between chromosome 4 (nucleotide 
positions 1-3221) and chromosome 15 (nucleotide positions 3229-6277) isolated from the 
translocation 4:15 is set out as SEQ ID NO: 11. Nucleotide positions 3222-3228 of SEQ ID 
NO: 11 are a 7bp insertion created in the translocation event. The localisation of this sequence 
is shown in Fig. 4. 
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Thegenomic DNA sequence at the fusion site between chromosome 15 (nucleotide 
positions 1-998) and chromosome 4 (nucleotide positions 1036-1829) isolated from the 
translocation 15:4 is set out as SEQ ID NO:12. Nucleotide positions 999-1035 of SEQ ID 
NO:12 are a 36bp insertion created in the translocation event. The localisation of this 

sequence is shown in Fig. 5: 

A cDNA encoding a novel isoform of RORa, designated "RORcc5", is set out as SEQ 
ID NO: 7 (with the first nucleotide of the RORa5 coding region accorded position 188), and 
its gene product is set out at SEQ ID NO: 8. 

According to one aspect of the present invention there is provided a polypeptide 

selected from the group consisting of: 

i) a fusion peptide of RORccl - UNC5C polypeptide comprising SEQ ID NO:2 
or a fragment thereof of at least 10 amino acids wherein the fragment 
comprises either the junction of the fusion protein, the flanking regions of 
the fusion specific exon or the fusion specific exon of SEQ ID NO:2; 

ii) a fusion polypepide of RORotl - UNC5C polypeptide comprising SEQ ID 
NO:4 or a fragment thereof of at least 10 amino acids wherein the fragment 
comprises either the junction of the fusion protein, the flanking regions of 
the fusion specific exon or the fusion specific exon of SEQ ID:4. 

iii) apolypeptide of RORcc5 polypeptide comprising the amino acid sequence of 
SEQ ID NO: 8. or a fragment thereof of at least 10 amino acids 

or an animo acid sequence at least 90%, preferably 95%, 98% or 99% identical to 
either i) or ii) or iii) or a complement thereof. 

In a preferred embodiment, a conservative analogue of the protein is contemplated. 

In this specification conservative amino acid analogues of specific amino acid 
sequences are contemplated which retain the relevant biological properties of a component of 
the invention but differ in sequence by one or more conservative amino acid substitutions, 
deletions or additions. However the specifically listed amino acid sequences are preferred. 
Typical conservative amino acid substitutions are tabulated below. 
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CONSERVATTVE SUBSTITUTIONS 



Original 


Exemplary Substitutions 


Preferred Substitutions 


Ala (A) 


Val; Leu; lie 


Val . 


Arg(R) 


Lvs;Gln;Asn ' - 


Lys 


Asn (N) 


Gin; His; Lys; Arg 


Gin 


Asp(D) 


niu 


Glu 


Cvs(C) 


Ser . . ■— 


Ser . 


Gin (Q) 


Asn 


Asn 


Glu (E) 


_A_E ■ 


Asp 


Glv(G) 


Pro 


Pro 


His (H) 


Asn; Gin; Lys; Arg 


Arg 


fled) 


Leu; Val; Met; Ala; Phe; Norleucine 


Leu 


Leu (L) 


Norleucine; lie; Val; Met; Ala; Phe 


He 


Lvs(K) 


Are; Gin; Asn 


Arg 


Met(M) 


Leu; Phe; He 


Leu 


Phe(F) 


Leu; Val; lie; Ala 


T ' mi 


Pro(P) 


Gly 


Gly 


Ser (S) 


Thr 


Thr 


Thr(T) 


Ser 


Ser 


Trp(W) 


Tyr 


Tvr 


Tyr(Y). 


Trp ; Phe; Thr; Ser 


Phe 


Val(V) 


lie; Leu; Met; Phe; Ala; Norleucine 


Leu 



In this specification nucleic acid variations (deletions, substitutions and additions) of 
5 specific nucleic acid sequences are contemplated which retain which the ability to hybridise 
under stringent conditions to the specific sequence in question. Stringent conditions are 
defined as 6xSSC, 0.1% SDS at 60° C for 5 minutes. However specifically listed nucleic acid 
sequences are preferred. It is contemplated that peptide nucleic acid may be an acceptable 
equivalent of polynucleotide sequences, at least for purposes that do not require translation 
10 into protein (Wittung (1994) Nature 368, 561). 
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By "the junction of the fusion protein", we mean that the fragment of at least 10 
amino acids includes the amino acid accorded position 55 and 56 of SEQ ID NO:2 or SEQ 
IDNO:4. 

By "the flanking regions of the fusion specific exon" we mean that the fragment of at 
least 10 ammo acids includes either the amino acids accorded position 383 and 384 or the 
amino acids accorded position 403 and 404 of SEQ ID NO:2 or SEQ ID NO:4 

The invention further features an isolated nucleic acid molecule capable of encoding 
the RORal-UNC5C polypeptide or a fragment thereof or the RORccS polypeptide or a 
fragment thereof as defined aboveor a complement thereof, or a nucleic acid molecule having 
at least 65% preferably 75%, 85%, 90%, 95%, 98% or 99%, identity thereto. Preferred 
nucleotide sequences are SEQ ID NO:l, 3 or 7. 

The invention further features a vector comprising the nucleic acid molecule as 

defined above. 

Also included within the invention is the vector as defined above, wherein said vector 

is an expression vector. 

The invention further features a host cell comprising the expression vector as defined 

above. 

The invention further features a method of making the RORocl-UNC5C polypeptide or 
the RORo5 polypeptide as defined above comprising: incubating the host cell as defined 
hereinabove in a medium conducive to expression of the polypeptide and optionally purifying 
the polypeptide. 

The invention further features an antibody specific for the RORccl-UNC5C 
polypeptide or the RORoc5 polypeptide as defined above. 

The invention further features a method for detecting the presence of the RORocl- 
UNC5C polypeptide or the RORa5 polypeptide as defined above in a biological sample 
comprising: 

contacting the sample with a binding agent capable of specifically binding to the 
polypeptide, and measuring the amount of binding. 

The invention further features a method of detecting the presence of the isolated 
nucleic acid molecule as defined above comprising: 

contacting said nucleic acid with a nucleic acid capable of specifically hybridizing to 
the translocation breakpoint. 
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Regions of specificity in the fusion protein/gene include, for example, the junction 
point and fusion specific insert. 

In yet another aspect, the invention features an isolated nucleic acid molecule 
comprising a nucleotide sequence having at least 65% identity to a degenerate variant of SEQ 
ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7. The invention further features an isolated 
nucleic acid molecule comprising the complement of SEQ ID NO: 1, SEQID NO: 3 or SEQ 
ID NO: 7. 

. The invention further features an isolated nucleic acid encoding a RORcd-UNC5C 
polypeptide, the isolated nucleic acid molecule comprising a nucleotide sequence having at 
least 65% identity to a degenerate variant of SEQ ID NO: 1 or SEQ ID NO: 3. 

The invention further features an isolated nucleic acid encoding aRORa5 polypeptide, 
the isolated nucleic acid molecule comprising a nucleotide sequence having at least 65% 
identity to a degenerate variant of SEQ ID NO: 7. 

The invention further features a vector comprising the nucleic acid molecule of any of 
the above and a host cell comprising the vector. 

Also included within the invention is an isolated nucleic acid molecule encoding a 
protein having 65% sequence identity to the amino acid sequence of SED ID NO:2, SEQ ID 
NO: 4 or SEQ ID NO: 9. Moreover the invention features a purified polypeptide comprising 
an amino acid sequence having at least 65% identity to the amino acid sequence of SEQ ID. 
NO: 2, SEQ ID NO: 4 or SEQ ID NO: 9. 

Also within the invention is a method for producing a protein comprising culturing the 
host cell having SEQ ID NO:l, SEQ ID NO: 3 or SEQ ID NO:7 under conditions whereby 
said protein is produced, and recovering the protein from the host cell culture. 

The invention further includes a method for detecting a polynucleotide which encodes 
a protein comprising the amino acid sequence of SEQ ID NO:2, SEQ ID NO: 4 or SEQ ID 
NO: 8 in a biological sample comprising the steps of: 

a) hybridizing the complement of the polynucleotide sequence which encodes SEQ ID 
NO: 2, SEQ ID NO: 4 or SEQ ID NO:. 8 to a nucleic acid material of a biological sample, 
thereby forming a hybridization complex; and 

b) detecting the hybridization complex, wherein the presence of the complex correlates 
with the presence of a polynucleotide encoding a protein in the biological sample. 

One aspect of the invention is directed to a method for detecting the presence of other, 
obesity suscetibility genes which may be responsible for the development of obesity by 
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determining the identity of genes in and around the resulting translocation junction t(4:15)or 
t(15:4). The method includes determining the identity of genes in and around the 
translocation junction, e.g., located 20 cM, or less, adjacent to the junction and determining if 
the gene shows aberrant expression compared to the gene in a person not having the 
chromosomal translocation. 

As used herein, an "obesity susceptibility gene" or "obesity gene" is a gene which as a 
consequence of the translocation event between chromosome 4 and chromosome 15 has a 
predisposing influence in the development of obesity in a subject. For example, an obesity 
susceptibility gene can refer to any or all of the following genes: UNC5C, sIINC5C , RORal , 
RORa5, lRORal-UNC5C and sRORal-UNC5C. 

As used herein "obesity protein" or "obesity susceptibility polypeptide" is the gene 
product of the obesity susceptibility gene or obesity gene. 

In yet another aspect, the invention features a method for detecting the chromosomal 
translocation comprising analysing a sample of DNA from an individual for the presence of 
the translocation breakpoint, wherein the presence of the translocation breakpoint indicates 
that the person has a susceptibility to developing obesity. The nucleic acid sequence of the 
translocationjunctionst(4:15)andt(15.:4)is set out in SEQ ID NO: 11 andSEQIDNO: 12 
respectively. The localisation of these sequences are shown in Figs. 4 and 5 respectively. 

The invention further includes a method for identifying a test compound that 
modulates the expression of an obesity susceptibility gene such as UNC5C, sUNC5C, 
• RORal, RORa5, lRORal-UNC5C and sRORal-UNC5C identified in the methods above, 
comprising contacting a cell capable of expressing an obesity susceptibility gene with a test 
compound; and determining the level of expression of the obesity susceptibility gene in the 
presence of the test compound, wherein a decrease or an increase in the obesity susceptibility 
gene expression, as compared to the level of expression of the obesity susceptibility gene in 
the absence of the compound, is indicative that the test compound modulates the expression of 
the obesity susceptibility gene. 

Also within the invention is a method of identifying a test compound that modulates 
the activity of a protein encoded by an obesity susceptibility gene such as UNC5C, sUNCSC, 
RORal, RORa5, lRORal-UNC5C and sRORal-UNC5C, comprising contacting the protein 
with a test compound and determining the level of activity of the obesity protein in the 
presence of the compound, wherein a decrease or an increase in protein activity is indicative 
that the test compound modulates protein activity. 
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The invention further features a method of treating a subject having obesity 
comprising administering an effective amount of the compound identified above. The 
invention further includes a pharmaceutical composition comprising the compound identified 
as above, and a pharmaceutically acceptable adjuvant, diluent or carrier. 

The invention further includes a method of making a pharmaceutical composition, hi 
one aspect, the method includes: contacting a cell capable of expressing an obesity 
susceptibility gene with a test compound; determining the level of expression of the obesity 
susceptibility gene identified above in the presence of the test compound, wherein a decrease 
or an increase in expression of the obesity susceptibility gene, as compared to the level of 
expression of the obesity susceptibility gene in the absence of the compound, is indicative that 
the test compound modulates the obesity susceptibility gene expression; and formulating the 
test compound that modulates the obesity susceptibility gene expression into a pharmaceutical 
composition. 

In another aspect, the invention includes making a pharmaceutical composition 
including contacting an obesity susceptibility protein with a test compound; determining the 
level of activity of the obesity susceptibility protein in the presence of the compound, wherein 
a decrease in the presence of the obesity susceptibility protein, as compared to the level of 
activity of the obesity susceptibility protein in the absence of the compound, is indicative that 
the test compound modulates obesity susceptility protein activity; and formulating the test 
compound that decreases obesity susceptibility protein activity into a pharmaceutical 
composition. . 

In another aspect, the invention includes a method for determining if an obesity 
susceptibility gene identified above has an altered level of gene expression comprising 
comparing the level of obesity gene expression in a cell from a patient having obesity with a 
control cell, and detemining the level of expression of the obesity susceptibility gene in both 
cells, wherein a decrease or an increase in expression of the obesity susceptibility gene, as 
compared to the level of expression of the obesity susceptibility gene in the control cell, 
indicates that the obesity susceptibility gene has altered gene expression. 

The invention further includes methods of diagnosing obesity or a susceptibility 
thereto in a subject. The method includes determining the level of an obesity susceptibility 
gene mRNA and/or the level of an obesity causing protein in a sample from a subject; and 
comparing the level of the mRNA and/or protein in the sample with a control, wherein a 
decrease or an increase in the level of the mRNA and/or protein in the sample compared to the 
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control indicates that the subject has obesity, or a susceptibility thereto. The invention also 
extends to products useful for carrying out the assay, such as DNA probes (labelled or 

unlabelled), kits and the like. 

the invention further includes methods of diagnosing obesity or a susceptibility 
thereto in a subject The method includes analysing for the presence of RORdl-UNC5C 
mRN A and/or protein in a sample from a subject; wherein the presence of the RORal- 
UNC5C mRNA and/or protein indicates that the subject has obesity, or a susceptibility 
thereto. The invention also extends to products useful for carrying out the assay, such as 
DNA probes (labelled or unlabelled), kits and the like. 

' The present invention also includes gene-based therapies directed at obesity. 
Therapies may be in the form of polynucleotides comprising all or a portion of the obesity 
susceptibility gene(s), or obesity gene locus, placed in appropriate vectors or delivered to 
target cells in direct ways which would modify the function of the obesity protein. 

General molecular biology techniques are described in "Current Protocols in Molecular 
Biology Volumesl-3, edited by F.M Asubel, R Brent and R E Kingston; published by John 
Wiley, 1998 and Sambrook, J. and Russell, D.W., Molecular Cloning: A Laboratory Manual, 
the third edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 2001. 

Fig 1 shows a pedigree of the family with schematically drawn chromosomes showing 
the inheritance of the balanced translocation where all affected individuals are extremely . 

obese. BMI = body mass index. 

Fig 2 shows the normal gene map of human chromosome 4 at the translocation 

breakpoint. 

■ Fig 3 shows the normal map of chromosome 15 at the translocation breakpoint with 
the RORa locus. 

Fig 4 shows the outcome of the translocation event at chromosome 4 (translocation 
chromosome 4:15) with the resulting fusion transcripts. 

Fig5 shows the outcome of the translocation event at chromosome 15 (translocation 
chromosome 15:4) With the destruction of RQRocl . 

Fig 6 shows DNA sequence alignment of the IRORal-XJNCSC (SEQ ID NO: 1) and 
) sRORal-UNC5C (SEQ ID NO: 3) fusion transcripts together with TJNC5C(AF055634) and 
the alternative spliceform of UNC5C (AAC90914). Also shown are the genomic exon 
sequences of RORal exon 1 and UNC5C exon 1-16. 
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Fig 7 shows a protein alignment of the lRORal-UNC5C (SEQ ID NO: 1) and 
sRORal-UNC5C (SEQ ID NO: 3) fusion proteins with UNC5C. 

Fig 8 shows a schematic view of the protein domain architecture of the lRORal- 
TJNC5C and sRORccl-UNC5C fusion protein transcripts together with that of normal human 
UNC5C. 

Fig. 9 shows Taqman™ analysis of RORa5 expression in various tissues, with the 

highest expression being found in brain. 

Fig. 10' shows a schematic view of the lRORocl-UNC5C and sRORcd-UNC5C fusion 
transcripts together with UNC5C and sUNC5C. 

The invention is based on the finding that a genetic alteration in patients with obesity 
is the translocation of two specific chromosomes (t(4;15)). This event leads to the production 
of two fusion proteins that is thought to promote the development of obesity. It also destroys 
one copy of RORcd (SEQ ID NOS: 5 & 6) thus maybe reducing the expression of this gene 
in certain tissues during development. It may also affect the expression of the other isoforms 
of RORa, in particular RORa5 (SEQ ID NOS: 7 & 8) which is the closest situated isoform of 
RORa on the other side of the breakpoint at chromosome 15. 

The breakpoint in chromosome 4 occurs at cytoband 4q22.3 and the breakpoint in 
chromosome 15 occurs at cytoband 15q22.2. The balanced translocation results in patients 
having one normal chromosome 4 and one normal chromosome 15 together with the 
reciprocal translocation chromosomes. The outcome of the translocation event is that the 
expression and\or activity of at least one or more genes is affected. 

The breakpoint at chromosome 4 is between the genes Pyruvate dehydrogense A2 
(PDHA2) and UNCSC (SEQ ID NOS: 9 & 10). While neither of these genes are destroyed in 
the mutation event, UNC5C was found to be upregulated in adipocytes from obese, affected 
individuals, as compared with control adipocytes. 

UNC5 is required for correct projections of pioneer axons and for migrating cells 
along the body wall in C. elegans (Leung Hagensteijn et. al., Cell 1992 Oct 71 (2): 289-99). 
UNC5 genetically interacts with UNC6 andUNC40. Mutants of all three genes defects in the 
axon projections. UNC5 and UNC40 are transmembrane receptors expressed on developing 
neurons. Both are receptors for the diffusible protein UNC6/Netrin (Hedgecock et. al.; Neuron 
1990 Jan ;4(1): 61-85) (Wadsworth; Trends Neurosci 2002 Aug ;25(8): 423-9) Vertebrates 
have at least 3 UNC5 like genes named UNC5A, UNC5B and UNC5C (Leonardo et. al.; 
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Nature 1997 Apr ;386 (6627): 833-8) and the protein contains two inmunoglobulin 
domains, two trombospondin domains on the extracellular part and intracellular domains 
ZU5, DB and a Death Domain (Leung Hagensteijn et.al., Cell 1992 Oct 71 (2): 289-99) 
(Keleman & Dickson; Neuron 2001 Nov ;32(4): 605117). Human UNC5C is localised on 
chromosome 4 at cytoband q22:3 (Ackerman & Knowles; Genomics 1998 Sep ;52(2): 205-8). 

The breakpoint at chromosome 15 is within the first intron of RORccl . This isoform of 
RORa is destroyed by the mutation event. Moreover, it was found that the translocation 
resulted in a novel transcript, which is a fusion between RORal exon 1 with UNC5C exon 2 
producing 2 novel fusion proteins through differential splicing within the UNC5C gene (SEQ 
IDNO:2 and SEQ ID NO: 4). 

RORal is an orphan nuclear hormone receptor localized on chromosome 15 q22.3 (V. 
Giguere et. al. Genomics 1995 Aug ;28(3): 596-8). Nuclear hormone receptors (NHR) 
constitute a super family of highly conserved intracellular receptors found in nematodes, flies 
and vertebrates. The protein architecture is a N-terminal transactivation domain followed by a 
DNA binding domain, hinge region and a c-terminal ligand binding domain. Many NHRs 
form dimers upon activation and translocate into the nucleus where they bind DNA to activate 
or repress target gene expression. There are four publicly available isoforms of RORal, 
namely RORal-4. 

We have identified yet another isoform of RORa, designated RORo5 (SEQ ID 7 & 8). 
RORa5 was isolated with 5'RACE (rapid amplification of C-terminal ends) performed on 
public available adipocyte cDNA library (clontech). The expression pattern of RORo5 is 
similar to RORal, and is most abundantly expressed in the brain (Fig. 9). 

Accordingly, the present invention describes the chromosomal location of obesity 
susceptibility genes and provides methods for detecting the presence of such genes. The 
present invention enables the development of novel therapies for obesity by screening for 
compounds and other entities, such as antibodies, which modulate the activity of the proteins 
encoded by the associated genes, e.g., UNC5C, sUNCSC, RORal, RORaS, lRORal-UNC5C 
and sRORal-UNC5C. Knowledge of the sequence of the obesity susceptibility genes also 
enables the development of novel antigene methods to modulate the expression of the obesity 
susceptibility gene and also enables the development of novel gene therapy techniques to treat 
obesity. The discovery of obesity susceptibility genes may also assist in developing novel 
methods for diagnosing obesity via (i) measuring the levels of the translated mRNA present in 
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affected tissue and (ii) measuring the levels of the protein in affected tissue. It is possible 
that the diagnosis of obesity, or the susceptibility of an individual to obesity, by these 
methods may be achieved in patients who do not yet display the classical symptoms of the 
disease. Such determination of susceptibility to obesity or the early detection of disease 
development may lead to earlier clinical intervention, than is currently possible, and may lead 
to more effective treatment of the disease. 
Further identification of obesity susceptibility genes 

The present findings describes a chromosomal translocation which alters the 
expression\activity of genes in and around the translocation junction causing obesity. The 
genetic location of such genes are at the breakpoint of a translocation between chromosome 
4q22.3 and chromosome 15q22.2. Thus, the present invention includes methods of 
identifying genes around the translocation breakpoint and determining if any gene around the 
translocation breakpoint has abherrant expression. The invention includes identifying genes, 
e.g., 8, 7, 6, 5, 4, 3, 2 or 1 cM around the translocation breakpoint and comparing the 
expression profile of these genes against the control genes. This can be done by using 
techniques known in the art to further analyze and delineate this genetic region. For example, 
using adipocytes isolated from obese individuals carrying the translocation and control 
adipocytes it is possible to compare the expression profile of an identified obese susceptibility 
gene identified in the regi6n, or the activity of the gene product of the gene and determine 

abherrant expression or activity. 

Techniques such as Northern analysis, in situ hybridization or expression profiling can 
be used to further verify the association of these genes with obesity. In one example, a 
reporter-based assay may be devised to detect whether an identified obesity susceptibility 
gene has a different transcription levels and/or message stability compared to the same gene 
in a person not susceptibile to obesity. Individuals who carry the obesity susceptibility gene 
may exhibit differences in their ability to regulate protein biosynthesis under different 
physiological conditions and may display altered physiological abilities. 

In another example, the level of obesity gene expression can be assayed by detecting 
and measuring obesity transcription. For example, RNA from adipocytes from a person 
carrying the translocation can be isolated and tested utilizing hybridization or PCR 
techniques. Such analyses can reveal both quantitative and qualitative aspects of the 
expression pattern of the obesity gene, including activation or inactivation of obesity gene 
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expression. 
Pathway mapping 

Also within the invention is the identification and elucidation of the obesity 
biochernical\signal transduction pathway of which the obesity susceptibility protein is a 
component. In particular, the invention includes identifying the other components of the 
obesity susceptibility protein biochemical pathway. In this way it is possible to identify the 
specific critical signaling pathway which links the disease stimulus to the cell's response and 
enables the identification of new potential targets for therapy intervention. 

As used herein, a obesity susceptibility protein "target molecule" is a molecule in the 
obesity biochemical pathway with which the obesity susceptibility protein binds or interacts, 
directly or indirectly, with, or is a molecule that regulates the expression of the obesity gene 
or translation of the protein in nature. For example, the target molecule can be a protein - 
which directly interacts with the obesity susceptibility protein, or can be a protein which does 
not itself directly interact with the obesity susceptibility protein, but which is a component of 
the obesity susceptibility protein biochemical pathway. In one example, the target molecule 
can be an intercellular protein or a protein which facilitates the association of downstream or 
upstream signaling molecules with the obesity susceptibility protein. 

According to a further aspect of the invention there is provided the use of the obesity 
susceptibility protein in research to identify further gene targets implicated in obesity. 
Methods for identifying proteins which interact with the obesity susceptibility protein are 
known in the art, e.g., the two-hybrid assay (see, e.g., U.S. Pat. No. 5,283,317; Zervos et al. 
Cell 72:223-232, 1993) or using cell culture techniques to identify binding partners. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for an obesity 
susceptibility protein is fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library of 
DNA sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene 
that codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact, in vivo, forming an obesity susceptibility protein - 
dependent complex, the DNA-binding and activation domains of the transcription factor are 
brought into close proximity. This proximity allows transcription of a reporter gene (e.g.; 
LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
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transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be isolated and used to obtain the cloned 

gene which encodes the "target" protein which interacts with the obesity susceptibility 

protein. 

Alternatively, binding partners for the obesity susceptibility protein can be identified 
using cell culture techniques or using cells obtained directly from an obesity patient. The 
method includes isolating the obesity susceptibility protein of interest from the cell and 
determining the identity of its target molecule. Initial screening can be accomplished by 
Western blot analysis to analyse immunochemically, using antibodies against the obesity 
susceptibility protein, the size of the obesity susceptibility protein -target molecule complex. 
Further analysis of the complex will reveal the identity of the target molecule. 

The gene and protein encoded by the target molecule is also a potential target for 
therapeutic intervention in obesity disease, for instance in the development of antisense 
nucleic acid targeted to the mRNA; or more widely in the identification or development of 
chemical or hormonal therapeutic agents. The person skilled in the art is also capable of 
devising screening assays to identify compounds (chemical or biological) that modulate 
(activate or inhibit) the identified gene, which compounds may prove useful as therapeutic 
agents in treating or preventing obesity. 

RORal-UNC5C and ROR06 nucleic acid sequence and polypeptide 

The invention encompasses a RORcd-UNC5C nucleic acid sequence having at least 
65%, 70%, 75%, 80%, 85%, 90% or 99%, sequence identity to the RORal-UNC5C nucleic 
acid sequence of SEQ ID NO: 1 or SEQ ID NO: 3. The invention also encompasses a 
RORo5 nucleic acid sequence having at.least 65%, 70%, 75%, 80%, 85%, 90% or 99%, 
sequence identity to the RORa5 nucleic acid sequence of SEQ ID NO: 7. The comparison of 
sequences and determination of percent sequence identity between two sequences can be 
accomplished using a mathematical algorithm In a preferred embodiment, the percent 
identity between two amino acid sequences is determined using the Needleman and Wunsch 
(J. Mol. Biol. (48):444-453 (1970)) algorithm which has been incorporated into the GAP 
program in the GCG software package (available at http://www.gcg.com), using either a 
Blosum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a 
length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity 
between two nucleotide sequences is determined using the GAP program in the GCG •software 
package (available at ttp://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight 
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of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the 
percent identity between two amino acid or nucleotide sequences is determined using the 
algorithm of E. Meyers and W. Miller (CABIOS, 4:11-17 (1989)) which has been 
incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a 
gap length penalty of 12 and a gap penalty of 4. 

The invention also encompasses polynucleotides which encode the RORal-UNC5C 
protein of SEQ ID NO: 2 or SEQ ID NO: 4, or the RORa5 protein of SEQ ID NO: 8. It will 
be appreciated by those skilled in the art that as a result of the degeneracy of the genetic code, 
a mulfitudeof nucleotide sequences encoding RORal-UNC5C polypeptide or RORa5 
polypeptide, some bearing minimal homology to the nucleotide sequences of any known and 
naturally occurring gene, may be produced. Thus, the invention contemplates each and every 
possible variation of nucleotide sequence that could be made by selecting combinations based 
on possible codon choices. These combinations are made in accordance with the standard 
triplet genetic code as applied to the nucleotide sequence of naturally occurring RORal- 
UNC5C polypeptide or RORa5 polypeptide, and all such variations are to be considered as 
being specifically disclosed. 

The invention also encompasses production of DNA sequences, or fragments thereof, 
which encode lRORal-UNC5C, sRORal-UNC5C or RORa5 polypeptide and their 
derivatives, entirely by synthetic chemistry. After production, the synthetic sequence may be 
inserted into any of the many available expression vectors and cell systems using reagents that 

are well known in the art. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed RORal-UNC5C or RORoc5 nucleic acid seqeunce, and in 
particular, those shown in SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7, under various 
conditions of stringency as taught in Wahl, G. M. and S. L. Berger (1987; Methods Enzymol. 
152:399-407) and Kimmel, A. R. (1987; Methods Enzymol. 152:507-511). Appropriate 
stringency conditions which promote DNA hybridization, for example, 6.0 X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 X SSC at 50°C, are 
known to those skilled in the art or can be found in Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in the wash 
step can be selected from a low stringency of about 2.0 SSC at 50°C to a high stringency of 
about 0.2 X SSC at 50°C. I n addition, the temperature in the wash step can be increased from 
low stringency conditions at room temperature, about 22° C, to high stringency conditions at 



100852 



-17- 



) 



about 65°C. Moderately stringent conditions are, for example at about 2.0 X SSC and about 
40°C. 

Also included in the invention are RORal-UNC5C polypeptides having at least 65% 
amino acid sequence identity to the RORal-UNCSC polypeptide of SEQ ID NO: 2 or SEQ 
ID NO: 4 and which retains at least one biological Or other functional characteristic or activity 
of the RORal-UNCSC polypeptide. A most preferred RORal-UNC5C variant is one having 
at least 65% 70%, 80% or 90% amino acid sequence identity to SEQ ID NO: 2or SEQ ID NO: 

4. ' 

Also included in the invention is RORo5 polypeptide having at least 65% amino acid 

sequence identity to the RORoS polypeptide of SEQ ID NO: 8 and which retains at least one 

biological or other functional characteristic or activity of the RORoS polypeptide. A most 

preferred RORoS variant is one having at least 65% 70%, 80% or 90% amino acid sequence 

identity to SEQ ID NO: 8. 

The invention also includes variants of the RORccl-UNC5C polypeptide or the 
RORo5 polypeptide which can contain one or more substitutions of amino acid residues 
which result in a silent change and a functionally equivalent RORal-UNCSC polypeptide or 
RORa5 polypeptide respectively. Deliberate amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues as long as the biological or immunological activity of 
RORal-UNC5C polypetide or RORa5 polypeptide is retained. For example, negatively 
charged amino acids may include aspartic acid and glutamic acid; positively charged amino 
acids may include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values may include leucine, isoleucine, and valine, glycine and 
alanine, asparagine and glutamine, serine and threonine, and phenylalanine and tyrosine. 
Obesity Proteins and Polypeptides 

• Obesity proteins such as the RORal, RORa5, UNC5C, sUNC5C or the RORal- 
UNCSC fusion proteins can be prepared for a variety of uses, including but not limited to the 
generation of antibodies, as reagents in diagnostic assays, in the identification of other cellular 
gene products involved in the pathogenesis of obesity, as reagents in assays for screening for 
compounds that can be used in the treatment of obesity, and as pharmaceutical reagents useful 

in the treatment of obesity. 

A variety of host-expression vector systems may be utilized to express the obesity 
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nucleotide sequences of the invention. The expression systems that may be used for 
purposes of the invention include but are not limited to microorganisms such as bacteria 
(e.g., E. coli, B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA or 
cosmid DNA expression vectors containing obesity nucleotide sequences or mammalian cell 
systems (e.g., COS, CHO, BHK, 293, 3T3) harbouring recombinant expression constructs 
containing promoters derived from the genome of mammalian cells (e.g., metallothionein 
promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the vaccinia virus 
7. 5K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the obesity gene product being expressed For example, 
when a large quantity of such a protein is to be produced,for the generation of pharmaceutical 
compositions of obesity protein or for raising antibodies to the obesity protein, for example, 
vectors which direct the expression of high levels of fusion protein products that are readily 
purified may be desirable. Such vectors include, but are not limited to, the E. coli expression 
vector pUR278 (Ruther et al, EMBO J. 2:1791, 1983), in which the obesity coding sequence 
may be ligated individually into the vector in frame with the lacZ coding region so that a 
fusion protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids Res. 13:3101-3109, 
1985; Van Heeke & Schuster, J. Biol. Chem. 264:5503-5509, 1989); and the like. PGEX 
vectors may also be used to express foreign polypeptides as fusion proteins with glutathione 
S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysed cells by adsorption to glutathione-agarose beads followed by elution in the 
presence of free glutathione: The PGEX vectors are designed to include thrombin or factor Xa 
protease cleavage sites so that the cloned target gene product can be released from the GST 
moiety. 

In mammalian host cells, a number of viral-based expression systems may be utilized. 
In cases where an adenovirus is used as ah expression vector, the obesity nucleotide sequence 
of interest may be ligated to an adenovirus transcription/translation control complex, e.g., the 
late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of 
the viral genome (e.g., region El or E3) will result in a recombinant virus that is viable and 
capable of expressing the obesity gene product in infected hosts. (Eg- See Logan & Shenk, 
Proc. Natl. Aobesity. Sci. USA 81:3655-3659,1984). . 

For long-term, high-yield production of recombinant proteins, stable expression is 
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prefened. For example, cell lines which stably express the obesity sequences described 
above may be engineered. Rather than using expression vectors which contain viral origins 
of replication, host cells can be transformed with DNA controlled by appropriate expression 
control elements, (e.g., promoter, enhancer sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker, Following the introduction of the foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, and then 
are switched to a selective media. The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably integrate the plasmid into their 
chromosomes and grow to form'foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines which express the obesity 
gene product. Such engineered cell lines may be particularly useful in screening and 
evaluation Of compounds that affect the endogenous activity of the obesity gene product. 

Alternatively, any fusion protein may be readily purified by utilizing an antibody 
specific for the fusion protein being expressed. For example, a system described by Janknecht 
et al allows for the ready purification of non-denatured fusion proteins expressed in human 
cell lines (Janknecht, et al, Proc. Natl. Aobesity. Sci. USA 88:8972-8976, 1991). In this 
system, the gene of interest is subcloned into a vaccinia recombination plasmid such that the 
gene's open reading frame is translationally fused to an amino-terminal tag consisting of six 
histidine residues. Extracts from cells infected with recombinant vaccinia virus are loaded 
onto Ni.2+. nitriloacetic acid-agarose columns and histidine-tagged proteins are selectively 
eluted with imidazole-containing buffers. 

The obesity gene products can also be expressed in transgenic animals. Animals of any 
species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, goats, 
and non-human primates, for example, baboons, monkeys, and chimpanzees may be used to 
generate obesity transgenic animals. 

Any technique known in the art may be used to introduce the obesity transgene into 
animals to produce the founder lines of transgenic animals. Such techniques include, but are 
not limited to pronuclear microinjection (U.S. Pat. No. 4,873,191); retrovirus mediated gene 
transfer into germ lines (Van der Putten et al , Proc. Natl. Aobesity. Sci. USA 82:6148-6152, 
1985); gene targeting in embryonic stem cells (Thompson et al, Cell 56:313-321, 1989); 
electroporation of embryos (Lo, Mol. Cell. Biol. 3:1803-1814, 1983); and sperm-mediated 
gene transfer (Lavitrano et al, Cell 57:717-723, 1989); etc. For a review of such techniques, 
see Gordon, Transgenic Animals, Intl. Rev. Cytol. 115:171-229, 1989, which is incorporated 
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by reference herein in its entirety. 

The present invention provides for transgenic animals that carry the obesity transgene 
in all their cells, as well as animals which carry the transgene in some, but not all their cells, 
i.e., mosaic animals. The transgene may be integrated as a single transgene or in concatamers, 
e.g., head-to-head tandems or head-to-tail tandems. The transgene may also be selectively 
introduced into and activated in a particular cell type by following, for example, the teaching 
of Lasko et al (Lasko, M. et al, Proc. Natl. Aobesity. Sci. USA 89:6232-6236, 1992). Once 
transgenic animals have been generated, the expression of the recombinant obesity gene may 
be assayed utilizing standard techniques. Initial screening may be accomplished by Southern 
blot analysis or PCR techniques to analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA expression of the transgene in the tissues of the 
transgenic animals may also be assessed using techniques which include but are not limited to 
Northern blot analysis of tissue samples obtained from the animal, in situ hybridization 
analysis, and RT-PCR. Samples of obesity gene-expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the obesity transgene product. 
Antibodies to obesity Proteins 

Antibodies that specifically recognize one or more epitopes of obesity proteins, or 
peptide fragments, are also encompassed by the invention. Such antibodies include but are not 
limited to polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric 
antibodies, single chain antibodies, Fab fragments, F(ab').2 fragments, fragments produced by 
a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of 
any of the above. 

The antibodies of the invention may be used, for example, in the detection of the 
obesity in a biological sample and may, therefore, be utilized as part of a diagnostic or 
prognostic technique whereby patients may be tested for abnormal amounts of obesity. Such 
antibodies may also be utilized in conjunction with, for example, compound screening 
schemes for the evaluation of the effe6t of test compounds on expression and/or activity of the 
obesity gene product. Additionally, such antibodies can be used in conjunction with the gene 
therapy techniques described, for example, evaluate the normal and/or engineered obesity - 
expressing cells prior to their introduction into the patient. Such antibodies may additionally 
be used as a method for the inhibition of abnormal obesity activity. Thus, such antibodies 
may, therefore, be utilized as part of obesity treatment methods. 

Methods of making and detecting labelled antibodies are well known (Campbell; 
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Monoclonal Antibody Technology, in: Laboratory Techniques in Biochemistry and 
Molecular Biology, Volume 13, Eds: Burdon R et al. Elsevier, Amsterdam (1984)). The 
term antibody includes both monoclonal antibodies, which are a substantially homogeneous 
population, and polyclonal antibodies which are heterogeneous populations. The term also 
includes inter alia, humanised and chimeric antibodies. Monoclonal antibodies to specific 
antigens may be obtained by methods known to those skilled in the art, such as from 
hybridoma cells, phage display libraries or other methods. Monoclonal antibodies may be 
inter alia, human, rat or mouse derived. For the production of human monoclonal antibodies, 
hybridoma cells may be prepared by fusing spleen cells from an immunised animal, e.g. a 
mouse, with a tumour cell. Appropriately secreting hybridoma cells may thereafter be 
selected (Koehler & Milstein, Nature 256:495-497 (1975); Cole et al, "Monoclonal 
antibodies and Cancer Therapy", Alan R Liss Inc, New York N.Y. pp 77-96 (1985)). Such 
antibodies maybe of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any 
subclass thereof. 

Polyclonal antibodies can be generated by immunisation of an animal (such as a 
mouse, rat, goat, horse, sheep etc) with an antigen, such as one of the obesity proteins used in 
this invention. 

Rodent antibodies may be humanised using recombinant DNA technology according 
to techniques known in the art. Alternatively, chimeric antibodies, single chain antibodies, 
Fab fragments may also be developed against the polypeptides of the invention (Huse et al, 
Science 256:1275-1281 (1989)), using skills known in the art. Antibodies so produced have a 
number of uses which will be evident to the molecular biologist or immuhologist skilled in the 
art. Such uses include, but are not limited to, monitoring enzyme expression, development of 
assays to measure enzyme activity and use as a therapeutic agent. Enzyme linked 
immunosorbant assays (ELIS As) are well known in the art and would be particularly suitable 
for detecting the obesity protein or polypeptide fragments thereof in a test sample. . 

Screening Assay 

The invention also provides a method for identifying modulators, i.e., test compounds 
(e.g., peptides, peptidomimetics, small molecules or other drugs) which modulate the activity 
of an obesity susceptibility gene. 

In one example, the invention provides methods for screening for test compounds for 
use in the treatment of obesity by screening for'test compounds that modulate the activity of 
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the obesity protein, or a portion thereof. The test compounds of the present invention can be 
obtained using any of the numerous approaches in combinatorial library methods known in 
the art, including: biological libraries. The method can be a celi-based method or a cell free 
method. The screening methods according to the invention may be operated using 
conventional procedures, for example by bringing the test compound or compounds to be 
screened and an appropriate substrate into contact with the obesity polypeptide, or a cell 
capable of producing it, or a cell membrane preparation thereof, and determining affinity for 
the obesity polypeptide in accordance with standard techniques. 

Any compound identified in this way may prove useful in the treatment of obesity in 
humans and/or other animals. The invention thus extends to a compound selected through its 
ability to regulate the activity, of the obesity protein in vivo as primarily determined in a 
screening assay utilising an obesity polypeptide or a homologue or fragment thereof, or a gene 
coding therefore for use in the treatment of a disease in which the over- or under-activity or 
unregulated activity of the protein is implicated. 

° According to a further aspect of the invention there is provided a screening assay or 
method for identifying potential anti- obesity therapeutic compounds comprising contacting 
an assay system capable of detecting the effect of a test compound against expression level of 
obesity, with a test compound and assessing the change in expression level of obesity. 
Compounds that modulate the expression of DNA or RNA of obesity polypeptides may be 
detected by a variety of assay systems. A suitable assay system may be a simple "yes/no- 
assay to determine whether there is a change in expression of a reporter gene, such as beta- 
galactosidase, luciferase, green fluorescent protein or others known to the person skilled in the 
art (reviewed by Naylor, Biochem. Pharmacol. 58:749-57,1999). The assay system may be 
made quantitative by comparing the expression or function of a test sample with the levels of 
expression or function in a standard sample. Systems in which transcription factors are used 
to stimulate a positive output, such as transcription of a reporter gene; are generally referred to 
as "one-hybrid systems" (Wang, M.M. and Reed, R.R. Nature 364:121-126, 1993). Using a 
transcription factor to stimulate a negative output (growth inhibition) may thus be referred to 
as a "reverse one-hybrid system" (Vidal et al, supra). Therefore, in an embodiment of the 
present invention, a reporter gene is placed under the control of the obesity promoter. 

In a further aspect of the invention we provide a cell or cell line comprising a reporter 
gene under the control of the obesity promoter. 
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According to another aspect of the present invention there is provided a method of 
screening for a compound potentially useful for treatment of obesity which comprises 
assaying the compound for its ability to modulate the activity or amount of obesity. 
Preferably the assay is selected from: 

i) measurement of obesity activity using a cell line which expresses the obesity 
polypeptide or using purified obesity polypeptide; and 

ii) measurement of obesity transcription or translation in a cell line expressing the 
obesity polypeptide. 

' Thus, in a further aspect of the invention, cell cultures expressing the obesity polypeptide 
can be used in a screen for therapeutic agents. Effects of test compounds may be assayed by 
changes in mRNA or protein of obesity. As described above, cells (i.e. mammalian, bacterial 
etc) can be engineered to express the obesity polypeptide. 

Thus, according to a further aspect of the invention there is provided a method of testing 
potential therapeutic agents for the ability to suppress the obesity phenotype comprising 
; contacting a test compound with a cell engineered to express the obesity polypeptide; and 
determining whether said test compound suppressed expression of the obesity polypeptide. 

We also provide a method for identifying inhibitors of transcription of obesity, which 
method comprises contacting a potential therapeutic agent with a cell or cell line as described 
above and determining inhibition of obesity transcription by the potential therapeutic agent by 
o reference to a lack of or reduction in expression of the reporter gene. 

Any convenient test compound or library of test compounds may be used in conjunction 
with the test assay. Particular test compounds include low molecular weight chemical 
compounds (preferably with a molecular weight less than 1500 daltons) suitable as 
pharmaceutical or veterinary agents for human or animal use, or compounds for non- 
* administered use such as cleaning/sterilising agents or for agricultural use. Test compounds 
may also be biological in nature, such as antibodies. 

According to a further aspect of the invention there is provided a compound identified by 

a screening method as defined herein. 

According to another aspect of the present invention there is provided use of a compound 
30 able to modulate the activity or amount of obesity in the preparation of a medicament for the 
treatment of obesity. Modulation of the amount of obesity by a compound may be brought 
about for example through altered gene expression level or message stability. Modulation of 
the activity of obesity by a compound may also be brought about for example through 
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compound binding to the obesity protein. In one embodiment, modulation of obesity 
comprises use of a compound able to reduce the activity or amount of obesity. In another 

embodiment, modulation of obesity comprises use of a compound able to increase the activity 

or amount of obesity. 

Diagnostic test 

To determine if an individual is suceptible to obesity determination of the presence of 
the translocation junction between chromosome 4q22,3 and chromosome 15q22.2 can be 
performed to predict whether the individual has a high risk obesity chromosome. Moreover, 
determination of the translocation breakpoint can be used to identify the underlying cause of 

obesity in a person who is obese. 

A diagnostic test method lends itself readily to the formulation of kits which can be 
utilized in diagnosis. The diagnostic kits may comprise appropriate packaging and 
instructions for use in the methods of the invention. Such kits may further comprise 
appropriate buffer(s) and polymerase(s) such as thermostable polymerases, for example taq 
polymerase. 

Methods for the diagnosis of obesity may, for example, utilize reagents such as the 
nucleotide sequences which span the breakpoint of sRORal -UNC5C or IRORal -UNC5C 
antibodies. Specifically, such reagents may be used, for example, for: (1) the detection of the 
presence of short or long RORal -UNC5C transcript, (2) the detection of perturbations or 
abnormalities in the signal transduction pathway mediated by RORal -UNC5C or UNC5C. 

For the detection of mutations, any nucleated cell can be used as a starting source for 
genomic nucleic acid. For the detection of an obesity susceptilibity gene such as sRORal - 
UNC5C or 1RORCC1-UNC5C gene products, any cell type or tissue in which the obesity gene 

is expressed can be used. 

Detection of the obesity Gene and Transcripts 

Obesity susceptibility gene transcripts can be detected by utilizing a number of 
techniques. Nucleic acid from any nucleated cell can be used as the starting point for such 
assay techniques, and may be isolated according to standard nucleic acid preparation 
procedures which are well known to those of skill in the art. 
o DNA may be used in hybridization or amplification assays of biological samples to detect 

abnormalities involving obesity gene structure or gene regulatory elements, including point 
mutations, insertions, deletions and chromosomal rearrangements. Such assays may include, 
but are not limited to, Southern analyses, single stranded conformational polymorphism 
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analyses (SSCP), and PCR analyses: 

Such diagnostic methods for the detection of obesity gene-specific mutations can 
involve for example, contacting and incubating nucleic acids including recombinant DNA 
molecules, cloned genes or degenerate variants thereof, obtained from a sample, for example, 
derived from a patient sample or other appropriate cellular source, with one or more labeled 
nucleic acid reagents including recombinant DNA molecules, under conditions favorable for 
the specific annealing of these reagents to their complementary sequences within the obesity 
gene. Preferably, the lengths of these nucleic acid reagents are at least 15 to 30 nucleotides. 
After incubation, all non-annealed nucleic acids are removed from the nucleic acid: obesity 
molecule hybrid. The presence of nucleic acids which have hybridized, if any such molecules 
exist, is then detected. Using such a detection scheme, the nucleic acid from the cell type or 
tissue of interest can be irnmobilized, for example, to a solid support such as a membrane, or a 
plastic surface such as that on a microliter plate or polystyrene beads. Detection of the 
remaining, annealed, labeled obesity nucleic acid reagents is accomplished using standard 
techniques well-known to those in the art. The obesity gene sequences to which the nucleic 
acid reagents have annealed can be compared to the annealing pattern expected from a normal 
obesity gene sequence in order to determine whether an obesity gene mutation is present. 
Detection of the obesity Gene Products 

Antibodies directed against wild type or mutant obesity gene products or conserved 
variants or peptide fragments thereof, can also be used as obesity diagnostics. Such 
diagnostic methods, can be used to detect abnormalities in the level of obesity gene 
expression, or abnormalities in the structure and/or temporal, tissue, cellular, or subcellular 
location of the obesity, and may be performed in vivo or in vitro, such as, for example, on 
biopsy tissue. 

For example, antibodies directed to epitopes of the obesity can be used in vivo to detect 
the pattern and level of expression of the obesity in the body. Such antibodies can be labeled, 
and injected into a subject in order to visualize binding to the obesity expressed in the body 
using methods such as X-rays, CAT-scans, or 1VERI- 

Additionally, any obesity fusion protein or obesity conjugated protein whose presence 
, can be detected, can be administered. For example, obesity fusion or conjugated proteins 
labeled with a radio-opaque or other appropriate compound can be administered and 
visualized in vivo, as discussed, above for labeled antibodies. 
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Alternatively, immunoassays or fusion protein detection assays, as described above, can 
be utilized on biopsy and autopsy samples in vitro to permit assessment of the expression 
pattern of the obesity. Such assays are not confined to the use of antibodies that define the 
obesity, but can include the use of antibodies directed to epitopes of any region of the obesity. 

The tissue or cell type to be analyzed will generally include those which are known, or 
suspected, to express the obesity gene. The protein isolation methods employed herein can, 
for example, be such as those described in Harlow and Lane (Harlow, E. and Lane, D., 1988, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.), which is incorporated herein by reference in its entirety. The isolated cells can 
be derived from cell culture or from a patient. The analysis of cells taken from culture may be 
a necessary step in the assessment of cells that could be used as part of .a cell-based gene 
therapy technique or, alternatively, to test the effect of compounds on the expression of the 
obesity gene. 
Gene therapy 

The invention encompasses gene therapy methods and compositions for treating and 
preventing obesity. 

In one example, the loss of normal obesity gene product function can result in the 
development of obesity. Increasing obesity gene product activity, or activation of the obesity 
pathway (e.g., downstream activation) would therefore facilitate progress in individuals 
exhibiting a deficient level of obesity gene expression and/or obesity activity. 

Alternatively, obesity may be ameliorated by decreasing the level of obesity gene 
expression, and/or obesity gene activity, and/or downregulating activity of the obesity 
pathway (e.g., by targeting downstream signaling events). Different approaches are discussed 
below. 

i Inhibition of obesity Expression or obesity Activity 

Any method that neutralizes or inhibits expression of the obesity gene (either 
transcription or translation), e.g. RORocl, RORa5 and RORal-TJNC5C, can be used to 

prevent or treat obesity. 

For example, the administration of soluble peptides, proteins, fusion proteins, or 
o antibodies (including anti-idiotypic antibodies) that bind to and "neutralize" circulating 
obesity can be used. Such obesity neutralizing peptides, proteins, fusion proteins, anti- 
idiotypic antibodies or Fabs are administered to a subject in amounts sufficient to treat or 
prevent obesity. 
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In an alternate embodiment, therapy can be designed to reduce the level of endogenous 
obesity gene expression, for example, using antisense or ribozyme approaches to inhibit or 
prevent translation of obesity mRNA-transcripts; triple helix approaches to inhibit 
transcription of the obesity gene; or targeted homologous recombination to inactivate or 
"knock out" the obesity gene or its endogenous promoter. Antisense, ribozyme or DNA 
constructs can be administered directly to the site containing the target cells; or can be 

directed to the target cells. 

Antisense approaches involve the design of oligonucleotides (either DNA or RNA) that 
are complementary to obesity mRNA. The antisense oligonucleotides will bind to the 
complementary obesity mRNA transcripts and prevent translation. Absolute complementarity, 
although preferred, is not required. A sequence "complementary" to a portion of an RNA, as 
referred to herein, means a sequence having sufficient complementarity to be able to 
hybridize with the RNA, forming a stable duplex; in the case of double-stranded antisense 
nucleic acids , a single strand of the duplex DNA may thus be tested, or triplex formation may 
be assayed. The ability to hybridize will depend on both the degree of complementarity and 
the length of the antisense nucleic acid.. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with an RNA it may contain and still form a stable duplex (or triplex, 
as the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by use 
of standard procedures to cieterrnine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, for example, the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences complementary to the 3' untranslated 
sequences of mRNAs have been shown to be effective at inhibiting translation of mRNAs as 
. well. Thus, oligonucleotides complementary to either the 5'- or 3 '-non-translated, non-coding 
regions of the obesity can be used in an antisense approach to inhibit translation of 
endogenous obesity mRNA. Oligonucleotides complementary to the 5' untranslated region of 
the mRNA should include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mRNA coding regions are less efficient inhibitors of 
translation but could be used in accordance with the invention. Whether designed to hybridize 
to the 5'-, 3'- or coding region of obesity mRNA, antisense nucleic acids should be at least six 
nucleotides in length, and are preferably oligonucleotides ranging from 6 to about 50 
nucleotides in length. In specific aspects, the oligonucleotide is at least 10 nucleotides, at least 
17 nucleotides, at least 25 nucleotides, or at least 50 nucleotides in length. 
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Regardless of the choice of target sequence, it is preferred that in vitro studies are first 
performed to quantitate the ability of the antisense oligonucleotide to inhibit gene 
expression. It is preferred that these studies utilize controls that distinguish between antisense 
gene inhibition and nonspecific biological effects of oligonucleotides. It is also preferred that 
these studies compare levels of the target RNA or protein with that of an internal control RNA 
or protein. Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control oligonucleotide. It is 
preferred that the control oligonucleotide is of approximately the same length as the test 
oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from the 
antisense sequence no more than is necessary to prevent specific hybridization to the target 
sequence. 

While antisense nucleotides complementary to the obesity coding region sequence could 
be used, those complementary to the transcribed untranslated region are most preferred. The 
antisense molecules should be delivered to cells which express the obesity in vivo. A number 
of methods have been developedfor delivering antisense DNA or RNA to cells; for example, 
antisense molecules can be injected dixectly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on the target cell surface) can be 

administered systemically. 

Ribozyme molecules designed to catalytically cleave obesity mRNA transcripts can also 
be used to prevent translation of obesity mRNA and expression of obesity. (See, e.g., PCT 
International Publication WO90/1 1364, published Oct. 4, 1990; Sarver et al, 1990, Science 
247:1222-1225). While ribozymes that cleave mRNA at site.specific recognition sequences 
can be used to destroy obesity mRNAs, the use of hammerhead ribozymes is preferred. 
Hammerhead.ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5'-UG-3'. The construction and production 
of hammerhead ribozymes is well known in the art and is described more fully in Haseloff 
and Geriach, 1988, Nature 334:585-591. There are hundreds of potential hammerhead 
ribozyme cleavage sites within the nucleotide sequence of human obesity cDNA. Preferably 
the ribozyme is engineered so that the cleavage recognition site is located near the 5' end of 
the OBESITY mRNA; i.e., to increase efficiency and minimize the intracellular accumulation 
of non-functional mRNA transcripts. 
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The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena 
Thermophila (known as the IVS, or L-19 IVS RNA) and which has been extensively 
described by Thomas Cech and collaborators (Zaug, et al, 1984, Science 224:574-578; Zaug 
and Cech, 1986, Science 231:470-475; Zaug, et al, 1986, Nature 324:429-433; published 
International patent application No. WO 88/04300 by University Patents Inc.; Been and Cech, 
1986, Cell 47:207-216). The Cech-type ribozymes have an eight basepair active site that 
hybridizes to a target RNA sequence, whereafter cleavage of the target RNA takes place. The 
invention encompasses those Cech-type ribozymes that target eight basepair active site 
sequences that are present in obesity. 

As in the antisense approach, the ribozymes can be composed of modified 
otigonucleotides (e.g. for improved stability, targeting, etc.) and should be delivered to cells 
which express the obesity in vivo. A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme under the control of a strong constitutive pol m or pol H 
promoter, so that transfected cells will produce sufficient quantities of the ribozyme to destroy 
endogenous obesity messages and inhibit translation. Because ribozymes, unlike antisense 
molecules, are catalytic, a lower intracellular concentration is required for efficiency. 

In an alternative embodiment for neutralizing circulating obesity, cells that are 
genetically engineered to express such soluble or secreted forms of obesity can be 
administered to a patient, whereupon they will serve as "bioreactors" in vivo to provide a 
continuous supply of the neutralizing protein. 

Endogenous obesity gene expression can also be reduced by inactivating or "knocking 
out" the obesity gene or its promoter using targeted homologous recombination. (E.g., see 
Smithies et al, 1985, Nature 317:230-234; Thomas & Capecchi, 1987, Cell 51:503-512; 
Thompson et al, 1989, Cell 5:313-321; each of which is incorporated by reference herein in 
its entirety). 

In yet another embodiment of the invention, the activity of obesity can be reduced using a 
"dominant negative" approach to effectuate obesity. To this end, constructs that encode 
defective obesity can be used in gene therapy approaches to diminish the activity of the 
> obesity in appropriate target cells. 

Pharmaceutical composition containing the Identified Candidate Compound 

The compounds having the desired activity may be administered in a physiologically 
acceptable carrier to an obesity patient. Such compositions of the invention may be in a form 
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suitable for oral use (for example as tablets, lozenges, hard or soft capsules, aqueous or oily 
suspensions, emulsions, dispersible powders or granules, syrups or elixirs), for topical use 
(for example as creams, ointments, gels, or aqueous or oily solutions or suspensions), for 
administration by inhalation (for example as a finely divided powder or a liquid aerosol), for 
administration by insufflation (for example as a finely divided powder) or for parenteral 
administration (for example as a sterile aqueous or oily solution for intravenous, 
subcutaneous, intramuscular or intramuscular dosing or as a suppository for rectal dosing). 

The compositions of the invention may be obtained by conventional procedures using 
conventional pharmaceutical excipients, well known in the art. Thus, compositions intended 
for oral use may contain, for example, one or more colouring, sweetening, flavouring and/or 
preservative agents. 

Suitable pharmaceutic ally acceptable excipients for a tablet formulation include, for 
example, inert diluents such as lactose, sodium carbonate, calcium phosphate or calcium 
carbonate, granulating and disintegrating agents such as com starch or algenic acid; binding 
agents such as starch; lubricating agents such as magnesium stearate, stearic acid or talc; 
preservative agents such as ethyl or propyl p-hydroxybenzoate, and anti-oxidants, such as 
ascorbic acid. Tablet formulations may be uncoated or coated either to modify their 
disintegration and the subsequent absorption of the active ingredient within the 
gastrointestinal track, or to improve their, stability and/or appearance, in either case, using 
conventional coating agents and procedures well known in the art. 

Compositions for oral use may be in the form of hard gelatin capsules in which the active 
ingredient is mixed with an inert solid diluent, for example, calcium carbonate, calcium 
phosphate or kaolin, or as soft gelatin capsules in which the active ingredient is mixed with 
water or an oil such as peanut oil, liquid paraffin, or olive oil. 

' Aqueous suspensions generally contain the active ingredient in finely powdered form 
together with one or more suspending agents, such as sodium carboxymethylcellulose, 
methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinyl-pyrrolidone, gum 
tragacanth and gum acacia; dispersing or wetting agents such as lecithin or condensation 
products of an alkylene oxide with fatty acids (for' example polyoxethylene stearate), or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
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heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with partial esters derived from fatty acids and 
hexitol anhydrides, for example polyethylene sorbitan monooleate. The aqueous suspensions 
may also contain one or more preservatives (such as ethyl or propyl p-hydroxybenzoate, anti- 
oxidants (such as ascorbic acid), colouring agents, flavouring agents, and/or sweetening 
agents (such as sucrose, saccharine or aspartame). 

Oily suspensions may be formulated by suspending the active ingredient in a vegetable 
oil (such as arachis oil, olive oil, sesame oil or coconut oil) or in a mineral oil (such as liquid 
paraffin). The oily suspensions may also contain a thickening agent such as beeswax, hard 
paraffin or cetyl alcohol. Sweetening agents such as those set out above, and flavouring, 
agents may be added to provide a palatable oral preparation. These compositions may be 
preserved by the addition of an anti-oxidant such as ascorbic acid. 

Dispersible powders and granules suitable for preparation of an aqueous suspension by 
the addition of water generally contain the active ingredient together with a dispersing or 
wetting agent, suspending agent and one or more preservatives. Suitable dispersing or 
wetting agents and suspending agents are exemplified by those already mentioned above. 
Additional excipients such as sweetening, flavouring and colouring agents, may also be 
present. 

The pharmaceutical compositions of the invention may also be in the form of 
oil-in-water emulsions. The oily phase may be a vegetable oil, such as olive oil or arachis oil, 
or a mineral oil, such as for example liquid paraffin or a mixture of any of these. Suitable 
emulsifying agents may be, for example, naturally-occurring gums such as gum acacia or gum 
tragacanth, naturally-occurring phosphatides such as soya bean, lecithin, an esters or partial 
i esters derived from fatty acids and hexitol anhydrides (for example sorbitan monooleate) and 
condensationproducts of the said partial esters with ethylene oxide such as polyoxyethylene 
sorbitan monooleate. The emulsions may also contain sweetening, flavouring and 
preservative agents. 

Syrups and elixirs may be formulated with sweetening agents such as glycerol, propylene 
o glycol, sorbitol, aspartame or sucrose, and may also contain a demulcent, preservative, 
flavouring and/or colouring agent. 

The pharmaceutical compositions may also be in the form of a sterile injectable aqueous 
or oily suspension, which may be formulated according to known procedures using one or 
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more of the appropriate dispersing or wetting agents and suspending agents, which have 
been mentioned above. A sterile injectable preparation may also be a sterile injectable 

solution or suspension in a non-toxic parenterally-acceptable diluent or solvent, for example a 

solution in 1,3-butanediol. 

Suppository formulations may be prepared by mixing the active ingredient with a suitable 

non-irritating excipient which is solid at ordinary temperatures but liquid at the rectal 

temperature and will therefore melt in the rectum to release the drug. Suitable excipients 

include, for example, cocoa butter and polyethylene glycols. 

Topical formulations, such as creams, ointments, gels and aqueous or oily solutions or 

suspensions, may generally be obtained by formulating an active ingredient with a 

conventional, topically acceptable, vehicle or diluent using conventional procedure well 

known in the art. 

Compositions for administration by insufflation may be in the form of a finely divided 
powder containing particles of average diameter of, for example, 30^ or much less, the 
powder itself comprising either active ingredient alone or diluted with one or more 
physiologically acceptable carriers such as lactose. The powder for insufflation is then 
' conveniently retained in a capsule containing, for example, 1 to 50mg of active ingredient for 
use with a turbo-inhaler device, such as is used for insufflation of the known agent sodium 
cromoglycate.' 

Compositions for administration by inhalation may be in the form of a conventional 
pressurised aerosol arranged to dispense the active ingredient either as an aerosol containing 
finely divided solid or liquid droplets. Conventional aerosol propellants such as volatile 
fluorinated hydrocarbons or hydrocarbons may be used and the aerosol device is conveniently 
arranged to dispense a metered quantity of active ingredient. 

For further information on Formulation the reader is referred to Chapter 25.2 in Volume 5 
of Comprehensive Medicinal Chemistry (Corwin Hansen; Chairman of Editorial Board), 
PergamOn Press 1990. 

The amount of active ingredient that is combined with one or more excipients to produce 
a single dosage form will necessarily vary depending upon the host treated and the particular 
route of administration. For example, a formulation intended for oral administration to 
humans will generally contain, for example, from 0.5 mg to 2 g of active agent compounded 
With an appropriate and convenient amount of excipients which may vary from about 5 to 
about 98 percent by weight of the total composition. Dosage unit forms will generally contain 
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about 1 mg to about 500 mg of an active ingredient. For further information on Routes of 
Administration and Dosage Regimes the reader is referred to Chapter 25.3 in Volume 5 of 

Comprehensive Medicinal Chemistry (Corwin Hansen; Chairman of Editorial Board), 

Pergamon Press 1990. 

The size of the dose for therapeutic or prophylactic purposes of a compound will 

naturally vary according to the nature and severity of the conditions, the age and sex of the 

animal or patient and the route of administration, according to well known principles of 

medicine. 

In using a compound for therapeutic or prophylactic purposes it will generally be 
administered so that a daily dose in the range, for example, 0.5 mg to 75 mg per kg body 
weight is received,, given if required in divided doses. In general lower doses will be 
administered when a parenteral route is employed. Thus, for example, for intravenous 
administration, a dose in the range, for example, 0.5 mg to 30 mg per kg body weight will 
generally be used. Similarly, for administration by inhalation, a dose in the range, for 
example, 0.5 mg to 25 mg per kg body weight will be used. Oral administration is however 
preferred. 

It will be obvious to those skilled in the art to which the invention pertains, that 
various changes and modifications may be made without departing from the scope of the 
invention defined by the claims. 
EXAMPLES 

A male patient having a BMI of 48 was karyotyped and abnormal chromosomes were 
identified. It was found that the abnormal chromosomes had the translocation products 
between chromosomw 4q22 and chromosome 15q22. The patient's sister and mother also 
had abnormally high BMI, and were also karyotyped (Fig. 1). It was subsequently found that 
the sister and mother were also carrying the balanced translocation. The breakpoints of each 
chromosome were further mapped with fish probes and southern blots and the breakpoints on 
each chromosome were sequenced. Using bacterial artificial chromosomes (B AC) clones 
spanning both breakpoints, the breakpoints were sequenced and assembled with the help of 
the public sequence to generate ordered contigs of the breakpoints region of chromosome 4 
and 15. RORal on chromosome 15 was found to be affected since the breakpoint is within 
the first intron of isoform 1 resulting in the destruction of one copy of RORal. 
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Example 2 

Preadipocytes were isolated from the affected boy and girl and used to study adipocyte 
function. " Gene expression was monitored for all genes mapped to the vicinity of the 
breakpoints both at the preadipocyte stage and after induction into mature adipocytes. The 
isolated adipocytes showed an altered phenotype compared with normal adipocytes both in 
general morphology, function and gene expression. lipid'droplets were incorporated as tiny 
droplets instead of the large ones normally seen. UNC5C was highly unregulated in the 
preadipocytes both before and after induction into mature adipocytes when compared with an 
adipocyte control cell line. 

Two differently sized UNC5C transcripts could be detected in preadipocytes isolated 
from the patients. These were cloned and sequenced and shown to be a fusion between 
RORcd exon 1 andUNC5C exon 2 giving rise to a novel protein. Both encoded a fusion 
transcript with the structure RORal-UNC5C fusion (SEQ ID NO: 1 and SEQ ID NO: 3). The 
shorter of the two(SEQ ID NO: 3) is a splice version using alternative splice sites within 
exons 1 1 and 12 of UNC5C. 
Patient Material 

All three patients from the family were carriers of the translocation. The family consisted 
of affected mother and two affected siblings, the father is normal build. Lymphoblastoid cell 
lines were established according to standard procedures. 
FISH Probes 

For yeast artificial chromosome (YAC) screening, YACs from chromosome 4 and 15 
from the CEPH human YAC library was used (YAC clones chromosome 4: 244C10, 676B9, 
767G2, 783H11, 840F2, 854F4, 858A2, 871A1, 881A12 and chromosome 15: 667B2, 
798C9, 801B4, 809G4, 827C7, 848E5, 854A11, 895G6, 897E9, 934G1, 937E4, 951F3). STS 
markers contained in the spanning YAC clones was used for PCR screening of bacterial 
artificial chromosome (BAC) libraries. 

The CEPH Human BAC library was screened using STS markers D4S2407 andD4S1559 
from YAC clone 858A2 on chromosome 4 and WI-7454 from YAC clone 798C9 on 
chromosome 15. STS marker AFMa244wc9 from chromosome 15 was used to screen BAC 
Human II library from Genome Systems, Inc. 

Restriction digestion with BamHI and HindBI was used to assemble a contig of the BAC 
clones and flanking clones from each contig were selected for BAC end walk. 
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B AC end sequencing was performed on whole B ACs purified with Qiagen Large Construct 
Purification kit (Qiagen) using sp6 and T7 primers. 

The end sequences of the B ACs were used to screen the nr (non-redundant) and htgs 
(high throughput genomic sequences) databases at the NCBI/NIH server using the BLASTN 
program. Positive B AC hits were ordered from BACPAC Resources (Children's Hospital 
Oakland) and used as FISH probes. 

To further map the breakpoints, sequences of approximately 6 kb from the spanning 
B ACs RP1 1-402L11 (AC093832) and RP1 1-90A19 (AC079068) was derived by long range 
PCR and used as probes for FISH 

Mini FISH probes used for chromosome 4: 



Forward primer 


Reverse primer 


SEO ED NO: 15 


SEQ ED NO : 16 


SEO ID NO: 17 


SEQ ED NO : 18 


SEO ID NO: 19 


SEQ ID NO: 20 


SEO ED NO: 21 


SEQ ID NO: 22 


SEO ED NO: 23 


SEQ ED NO: 24 


SEO ED NO: 25 


SEQ ID NO: 26 


Mini FISH probes used for chr< 


Forward primer 


Reverse primer 


SEO ED NO: 27 


SEQ ED NO: 28 


SEO ED NO: 29 


SEQ ID NO: 30 


SEO ED NO: 31 


SEQ ID NO: 32 


SEO ED NO: 33 


SEQ ID NO: 34 


SEO ED NO: 35 
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SEO ID NO: 37 


SEQ ID NO: 38 


SEO ID NO: 39 


SEQ ID NO: 40 


SEO ID NO: 41 


SEQ ID NO: 42 


SEQ ID NO: 43 


SEQ ID NO: 44 
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FISH analysis 

FISH was performed on metaphase chromosome spreads from the boy which is carrying 
the translocation. Metaphase spreads were prepared from lymhocytes according to standard 



methods. 
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Southern blot 

The probes used in southern blot were amplified by PCR from BAC clones RP11-402L11 
(chromosome 4) and RP1 1-90A19 (chromosome 15) and labelled by random priming. 
The probes used for chromosome 4 were: 



Forward primer 


Reverse primer 


SEO ID NO: 45 


SEO ID NO: 46 


SEO ID NO: 47 


SEO ID NO: 48 


SEO ID NO : 49 


SEO ID NO: 50 


SEO ID NO: 51 


SEO ID NO: 52 


SEO ID NO: 53 


SEO ID NO: 54 



The probes used for chromosome 15 were: 



Forward primer 


Reverse primer 


SEO ID NO: 55 


SEO ID NO: 56 


SEO ID NO: 57 


SEO ID NO: 58 


SEO ID NO: 59 


SEO ID NO: 60 


SEO ID NO: 61 


SEO ID NO: 62 



15 PCR and Sequencing of the Breakpoints 

The breakpoints could be bridged by long range PCR with chromosome derivate specific 
primers. For the chromosome 4 derivate the chromosome 4 primer SEQ ID NO: 63 and the 
chromosome 15 primer SEQ ID NO: 64 was used, and for the chromosome 15 derivate the 
chromosome 15 primer SEQ ID NO: 65 and the chromosome 4 primer SEQ ID NO: 66 was 

20 used. 
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The breakpoint on the chromosome 4 derivate PCR product could be sequenced using 
primer SEQ ID NO: 51 and the chromosome 15 derivate PCR product could be sequenced 
using primer SEQ ID NO: 52. 
RNA isolation 

Total RNA was isolated from cells using Trizole reagent (Gibco) according to the 
manufacturer's description. The isolated RNA was dissolved in DEPC treated water and 
concentration was determined by spectrophotometric measurement at 260nm. All samples 
were also electrophoresed in 1% agarose gel to assess integrity of ribosomal bands. 
DNAse treatment and cDNA synthesis 

15/ig of total RNA was treated with DNase to remove contaminating genomic DNA 
before cDNA synthesis. DNA-free™ from Ambion was used for this purpose. Shortly; 0.1 
volume of 10 times DNase buffer, 6 units of DNase and water up to 30/xl was added to each 
sample. Samples were incubated at 37°for 40 minutes. DNase inactivation reagent was added 
to each sample and after a 2 minutes incubation at room temperature it was removed by 
centrifugation. 

First strand cDNA synthesis was performed with SuperScript™First-Strand Synthesis 
System for RT-PCR (Invitrogen, life Technologies) according to enclosed protocol. 2/ig of 
DNase treated total RNA was used in each synthesis. Priming of synthesis was accomplished 
with 01igo(dT)i2-i8 .For each synthesis reaction a negative control with no enzyme was set up. 
PCR and cloning 

Full length cDNA of the fusion transcript was obtained with TaqPlus® Precision PCR 
System( Stratagene). In a total of 50/d, 1 time TaqPlus Precision buffer, 200fjM of each 
dNTP (Applied Biosystems), 300nM forward primer SEQ ID NO: 67, 300nM reverse primer 
SEQ ID NO: 68 and 2.5U TaqPlus Precision polymerase mixture. cDNA from above was 
used as template, 0.4/il/reaction (equivalent with 40ng total RNA). The reaction mixture was 
then subjected to thermal cycling in an PTC-200 from MJ research! Thermal cycling included 
one step initial denaturation one minute at 95°C, 35 cycles with 95°C one minute, 58°C one 
minute, 72°C three minutes and a final extension of ten minutes at 72°. The reaction mixture 
was then loaded on a 1% Low melt Agarose gel (BioRad). Inspection of the gel showed 2 
bands of approximately 2.8 and 3 kb. Both bands were cut out and ligated into pcR2.1- 
.TOPO® (Invitrogen) and sequentially transfected into TOP 10 (Invitrogen). 
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Both constructs were sequenced using Big Dye chemistry and 3100 Genetic 
Analyzer(Applied Biosystems). 
Quantification 

cDNA from above was also used for relative quantification on Applied Biosystems 7700. 

Quantitative real-time PCR was carried out using SYBR® Green PCR Master Mix in 
25/xL reactions run in triplicates. Template for the PCR was first strand cDNA from above 
(amount equivalent of lOng total RNA). Forward and reverse primers specific for UNC5c was 
used at a concentration of 400nM. As negative control for amplification of remaining' 
genomic DNA, no enzyme controls from cDNA synthesis was used. 
Shotgun library construction 

Human Bac clones where cultured over night in LB-media. DNA was prepared using 

Maxi-plasmid prep from Qiagen (VWR) . 

5fj.g purified DNA was fragmentet with a nebulizer. 

DNA was fragmented to 1 to 3 kb fragments, checked on 1% agarosegel. 

All according to the manufacturer's description in TOPO ® Shotgun Sub cloning Kit 
(Invitrogen). DNA was Blunt-end repaired, dephosphorylated and transformed into One Shot 
competent E.coli, also from the kit. 

The transformation reaction was plated on LB plates and incubated over night. 
A colony picker Q-PDI (Genetix) picked the colonies into 96-well micro plates with LB- 
media. 

One more incubation 37°C over night. 

Hydra, a dispenser moved the bakterial suspension to 96-deep well plates and a 34-hour 
37°C shake incubation was performed. After centrifugation, continued with DNApreparation. 

DNA preparation 

Add 50 til water, vortex 15 minutes. When complete resuspension add 70 y\ Stet- 
Tween20 with.5,2mg/ml Rnase and 8,4mg/ml Lysozyme. Denaturate in microwave 
1 minute add 335 pi water, vortex and let stand in ice for 10 minutes. Spin boxes for 30 
minutes x 4000rpm. The clear supernatante moves to new 96-well micro plates to be 
sequenced. 
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Sequencing 

Big Dye chemistry primer T3, primer T7 and ABI PRISM 3700 DNA Analyzer 
(Applied Biosystems) was used for sequencing clones. 
2x96 plate 

Big Dye mix 220/il (v.2.0) 

Water 220^1 

Buffer x 2,5 660fd ( 200mMTris-HCl ph 9.0, 5mM MgC12) 

Primer T7 or T3 ' 7ftl ( 100pmol//*l) 

Robot Biomek 2000; loaded the mix and the DNA samples into PCR-plates. 

PTC-200, (MJ Research) the thermal cycling instrument; 

95°G x 2', (95°C x 10", 50°C x 10", 60°C x 40 x 35 cycler and 8°C. 

Followed by centrifugation and ethanol cleaning up step. 

15/x.l water was added before running on 3700. . 

5 'RACE ofROR a. . 

Template used was Marathon Ready cDNA from Clontech (Human Adipocyte #7447(pooled 
from 11 male/female Caucasians, ages 19-57). 

Nested PCR was used to obtain discrete bands from the PCR reactions. In both reactions 
following concentrations was used; forward and revers primer 200nM, dNTP 200/iM, MgCl 
2mM, 0.125U/juL AmpliTaq Gold (Applied Biosystems) all in one time PCR buffer II 
(Applied Biosystems). 

In the first round, forward primer was API (linker primer, Clontech) (SEQ ID NO: 69). 
Reverse primer, situated in the RORalpha sequence (SEQ ID NO: 70). Template 2/xL 
(~0.1ng/jLtL, Marathon Ready cDNA). This reaction mixture was then subjected to thermal 
cycling in a PTC-200 from MJ research. Thermal cycling included one step initial 
denaturation of eight minutes at 95°C, 30 cycles with 94°C fortyfive seconds, 66°C fortyfive 
seconds, 72°C one minute and a final extension of ten minutes at 72°. 

In the second round of PCR, forward primer was AP2 (linker primer, Clontech) (SEQ ID NO: 
71). Reverse primer was the same as was used in the first round of PCR (SEQ 3D NO: 70). 
Temlate wasl/tL of a twenty times dilution of the first PCR-reaction. This mixture was 
subjected to another 30 cycles, as above, but with 64 instead of 66°C as annealing 
• temperature. 
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PCR reactions were loaded on a low melt agarose gel (1% from BioRad). Potentially 
interesting bands were chosen and cut out from the gel, slices melted at 65° and ligated into 
pcR2.1-TOPO® (Invitrogen). Ligation mixture was then used to transfect TOP 10 
(Invitrogen). 

Two clones from each band was then sequenced using Big Dye chemistry and 3100 
Genetic Analyzer(Applied Biosystems). 
Taqman analysis ofRORoS 

Primers where designed with Applied Biosystems program Primer Express 2.0 
Forward primer Tm 58,5°C was SEQ ID NO: 72. Reverse primer Tm 58,5°C was SEQ ID 
NO: 73. Templare poly A+RNA from the following human tissues was obtained from 
Clontech; Bone marrow, brain, kidney, liver, lung, skeletal muscle, small intestine, spleen, 
testis and fat. CDNA synthesis was made with Superscript™ First-Strand Synthesis System 
for RT-PCR, Invitrogen™ according to the protocol supplied by the manufaturer. 
Real time PCR was performed according to protol from the manufacturer and run on ABI 
Prism 7700® Sequence Detection System, Applied Biosystems. Results analysed with 
Sequence detector VI .7 

PCR detection of UNC5C splice variant transcript 

Forward primer was SEQ ID NO: 74 spanning bases 1586-1604 of human UNC5C ■ 
sequence AF055634. Reverse primer was SEQ ID NO 75 spanning 2386-2367 of transcript 
AF055634. Template was cDNA from human brain (mRNA from Clontech converted into 
cDNA using "Superscript™ First-Strand Synthesis System for RT-PCR" (Invitrogen)) 
according to protocol from the manufacturer. This PCR resulted in 3 bands. The expected one 
at ~800bp, a very faint one that was slightly shorter and one at ~550bp. The 800 and 550 
bands were cut out of the gel and purified using Qiaquick gel extraction kit (Qiagen) and 
then sequenced using Big Dye chemistry and 3100 Genetic Analyzer (Applied Biosystems). 
The 800 band was the expected UNC5c sequence ie partial UNC5c spanning bases 1586- 
2386. The 550bp fragment was UNC5C with a deletion of bases 1873-2078. This deletion is 
identical to the deletion in the sequence of UNC5Cb with accession number AAC90914. 
Bioinformatics 

Sequence assembly and alignment of gene sequences was made using Sequencer software 
from Gene codes corporation version 4.1.2 on a Macintosh G4. 
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Primary cell culture 

The following products were purchased from BioWhittaker (MD, USA): DMEM F-12, 
FBM (fibroblast basal medium), human fibroblast growth factor-fi (hFGF-P; 1 /ig/ml), 
Trypsin EDTA (200 mg/ml and 500 mg/ml), Fungizone (250jig/ml). Foetal calf serum (FCS) 
was provided from Harlan Sera lab (Leicestershire, UK). Pioglitasone (X-46) was from 
AstraZeneca R&D Molndal. 

0.008M Phosphate buffered saline (PBS); insulin and transferrin were obtained from Life 
Technologies. BSA fraction V, Collagenase (C-6885), IBMX (3-isobutyl-l-methylxanthine), 
dexamethasone, 0.05M Tris Buffered SaUne pH 8.0 were purchased from SIGMA. 0- 
mercaptoethanbl and glucose were from Kebo lab. Primaria™ tissue culture flasks were from 
Falcon. Nile Red was from Molecular Probes. 

Biopsies from humans were taken under aseptic conditions and transferred to 
DMEM/F12 supplemented with 25 jug/ml Gentamycin and Fungizone (2.5 fig/wl). The 
adipose tissue was digested in DMEM F-12 containing albumin (1 mg/ml), collagenase (1 
mg/ml), glucose (2 mg/ml), ! 25 Mg/ml Gentamycin and Fungizone (2.5 fig/xnl) in a shaking 
waterbath at 37°C for 30-45 min. The cell suspension was then filtered through a nylon mesh 
(lOO/im) and centrifuged at 650g for 10 minutes. The sedimented cells, preadipocytes, were 
resuspended and washed twice in DMEM F-12 supplemented with 25 fig/mi Gentamycin and 
Fungizone (2.5 Atg/ml). After the last wash the cells were resuspended in FBM (fibroblast 
medium) supplemented with 10% FCS, 25 /ig/ml Gentamycin and Fungizone (2.5 ng/ml, 
0.1% 0.1M mercaptoethanol, 10 mg/ml insulin, 10 mg/ml transferrin and 0.001 /ig/ml hFGF- 
P The cells was seeded and incubated in small Primaria culture flasks (25 cm 2 ) over night 
followed by washing with PBS and culture continued until the cells reached confluence. 
Thereafter the cells were washed in PBS (without Mg** and Ca**), treated with trypsin EDTA 
for a few minutes, resuspended and splited in fresh FBM and seeded into new small culture 
flasks. The medium was changed every day. The new cell lines were named PSc020108 
(male) and Asc020108 (female). 

The fraction with the mature adipocytes (after diggeration) was frozen in -80°C for further 

RNA preparation. 

Differentiation 

The undifferentiated preadipocytes were seeded into small cell culture flasks and cultured 
in DMEM/F12 supplemented with 10% FCS, 1% PEST, 10 /ig/ml insulin, 10 /ig/ml 
transferrin and 0.1 mM p-mercaptoethanol (referred to as standard medium). When the cells 
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have reached confluence the differentiation was induced by adding standard medium 
supplemented with 0.25/iM dexamethasone and 0.5 mM IBMX (referred to as induction 
medium 1) for three days. To maintain and extend the differentiation process the medium was 
changed to standard medium containing 10 fiM X-46 and 0.25^M Dexamethasone (referred 
to as induction medium 2) for 2 days. This process was repeated once and after that the 
culture process was continued only with standard medium supplemented with 0.25mM 
dexamethasone (referred to as maintenance mediuml). 
The lipid accumulation in the adipocytes was confirmed by Nile Red staining. 

Staining cells with Nile Red . 

Nile red is a benzophenoxazone dye. In older chemical and histochemical literature the 
dye sometimes referred to as nileblue A-oxazone. Nile red is poorly soluble in water but does 
dissolve in a wide variety of organic solvents. 

A lmg/ml stock solution of Nile red in acetone were prepared and stored protected from 
light. Staining could be carried out on either fixed (1.5% glutaraldehyde in PBS, for 5 min) or 
unfixed cells. Attached unfixed cultured cells were covered with PBS. The dye was then 
added directly to the flask to effect a 1:100 dilution to yield a final concentration of 10 /xg/ml. 
The preparation was incubated for a minimum of 5 min. For fluorescens microscopy excess 
dye was removed by briefly rinsing the cells with PBS. The suspension medium must not 
contain serum or albumin since they fcould act as a sink to draw Nile red out of the cells. Dye 
in the medium does not interfere with the observations since Nile red fluorescence is 
quenched in water. Nile red is not toxic for the cells so it is possible to continue the culture 
after staining when unfixed cells are used. 
Morphology and Fluorescence microscopy 

The differentiation of adipocyte precursors into mature adipocytes was assessed by . 
conventional microscopy at a 100-200-fold magnification. Cells were considered 
differentiated when their cytoplasm was filled with lipid droplets, clearly seen in phase 
contrast. The fluorescence microscopy studies was carried out using a Zeiss axiovert 25 
inverted microscope, equipped with a HBO 50 illuminator and with a CP-Achromat 10x/0.25 
phl/Varl objective lens and the excitation wavelength was 485 ran (filter set 487909). 
, The stained cells were photographed in color using the AxioVision system from Carl 

Zeiss. The pictures were processed in Adobe PhotoShop 6.0. 
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CLAIMS 

1. An isolated nucleic acid molecule comprising a nucleotide sequence having at least 65% 
identity to a degenerate variant of SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7. 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
comprises the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7. 

3. An isolated nucleic acid molecule comprising the complement of SEQ ID NO: 1, SEQ ID 
NO: 3 or SEQ ID NO: 7. 

4. An isolated nucleic acid molecule encoding a RORcd-UNC5C polypeptide, said nucleic 
acid molecule comprising a nucleotide sequence having at least 65% identity to a degenerate 
variant of SEQ ID NO: 1 or SEQ ID NO: 3. 

5. An isolated nucleic acid molecule encoding a RORa5 polypeptide, said nucleic acid 
molecule comprising.a nucleotide sequence having at least 65% identity to a degenerate 
variant of SEQ ID NO: 7. 

6. A vector comprising the nucleic acid molecule of any of claims 1-5. 

7. A host cell comprising the vector of claim 6. 

8. A purified fusion polypeptide of the RORal-UNC5C polypeptide comprising an amino 
acid sequence having at least 65% identity to the amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 4. 

9. The purified polypeptide of claim 8, wherein the amino acid sequence comprises the amino 
. acid sequence of SEQ ID NO: 2 or SEQ ID NO:4. 

10. A purified polypeptide of theRORa5 polypeptide comprising an amino acid sequence 
having at least 65% identity to the amino acid sequence of SEQ ID NO: 8. 
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11. The purified polypeptide of claim 10, wherein the amino acid sequence comprises the 
amino acid sequence of SEQ ID NO: 8. 

12. A method for producing a protein comprising: 

a) culturing the host cell of claim 7 under conditions whereby the protein is produced, 1 
and 

b) recovering the protein from the host cell culture. 

13. A method for detecting a polynucleotide which encodes a protein comprising the amino 
acid sequence of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO: 8 in a biological sample 
comprising the steps of: 

a) hybridizing a complement of the polynucleotide sequence which encodes SEQ ID NO: 
2, SEQ ID NO:4 or SEQ ID NO: 8 to a nucleic acid material of a biological sample, thereby 
forming a hybridization complex; and 

b) detecting the hybridization complex, wherein the presence of the complex correlates 
with the presence of a polynucleotide encoding a protein in the biological sample. 

14. A method for detecting the presence of an obesity susceptibility gene comprising: 

determining the identity of a gene located 20 cM, or less, adjacent to the 
translocation breakpoint between chromosome 4 at cytoband 4q22.3 and chromosome 15 at 
cytoband 15q22.2; and 

determining if the gene has aberrant gene expression as a consequence of the 

translocation event. 

15. A method for detecting the presence of a translocation junction between chromosome 4 at 
cytoband 4q22.3 and chromosome 15 at cytoband 15q22.2 comprising analysing a sample of 
DNA from an individual for the presence of the translocation junction. 

16. A method for identifying a test compound that modulates the expression of an obesity 
susceptibility gene identified in claim 14 comprising: 

contacting a cell capable of expressing the susceptibility gene with a test compound; and 
determining the level of expression of the obesity susceptibility gene in the presence of 
the test compound, wherein a decrease or an increase in expression of a obesity susceptibility 
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gene, as compared to the level of expression of an obesity susceptibility gene in the absence 
of the compound, is indicative that "the test compound modulates the expression of the 
obesity susceptibility gene. 

17. A method of identifying a test compound that modulates the activity of an obesity protein 
encoded by the obesity susceptibility gene identified in claim 14, comprising: 

contacting the obesity protein with a test compound; and 

determining the level of activity of the obesity protein in the presence of the compound, 
wherein a decrease or an increase in obesity protein activity, as compared to the level of 
activity of the obesity protein in the absence of the compound, is indicative that the test 
compound modulates obesity protein activity. 

18. A method of treating a subject having obesity comprising administering an effective 
amount of the compound identified in claims 16 or 17. 

19. Use of a compound identified in claims 16 or 17 in the preparation of a medicament for 
the treatment of obesity. 

20. A pharmaceutical composition comprising a compound identified in claims 17 or 17, and 
a pharmaceutical^ acceptable adjuvant, diluent or carrier. 

21. A method of making a pharmaceutical composition comprising: 

contacting a cell capable of expressing an obesity susceptibility gene with a test 
compound; 

determining the level of expression of the obesity susceptibility gene in the presence of 
the test compound, wherein a decrease in expression of a obesity susceptibility gene, as 
compared to the level of expression of a obesity susceptibility gene in the absence of the 
compound, is indicative that the test compound decreases obesity susceptibility gene 
expression; and 

formulating the test compound that decreases obesity susceptibility gene expression into 
pharmaceutical composition. 

22. A method of making a pharmaceutical composition comprising: 
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contacting an obesity protein with a test compound; 

determining the level of activity of the obesity susceptibility protein in the presence of 
the compound, wherein a decrease in obesity protein activity, as compared to the level of 
activity of the obesity protein in the absence of the compound, is indicative that the test 
compound decreases obesity protein activity; and 

formulating the test compound that decreases obesity protein, activity into a 
pharmaceutical composition. 

23. A method for determining if an obesity susceptibility gene identified in claim 14 has an 
altered level of gene expression comprising: 

comparing the level of obesity gene expression in a cell from a patient having obesity 

with a control cell, and 

determining the level of expression of the obesity susceptibility gene in both cells, 

wherein a decrease or an increase in expression of the obesity susceptibility gene, as 
compared to the level of expression of the obesity susceptibility gene in the control cell, 
indicates that the obesity susceptibility gene has altered gene expression. 

24. A method of diagnosing obesity, or a susceptibility thereto in a subject, the method 
comprising: 

determining the level of mRNA of TJNC5C; and 

comparing the level of mRNA of UNC5C in the sample with a control, wherein an 
increase in the level of UNC5C in the sample compared to the control indicates that the . 
. subject has obesity , or a susceptibility thereto. 

25. A method of diagnosing obesity or a susceptibility thereto in a subject, the method 
comprising: 

determining the level of aUNC5C protein in a sample from a subject; and 
comparing the level of UNC5C in the sample with a control, wherein an increase in the 
level of the protein in the sample compared to the control indicates that the subject has obesity 
» , or a susceptibility thereto. 
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26. A method of diagnosing obesity, or a susceptibility thereto in a subject, the method 
comprising analysing for the presence of RORlcd-UNC5C mRNA, wherein the presence of 
the mRNA indicates that the subject has obesity , or a susceptibility thereto. 

27. A method of diagnosing obesity or a susceptibility thereto in a subject, the method 
comprising analysing for the presence of the RORlcd-UNC5C fusion polypeptide, wherein 
the presence of the polypeptice indicates that the subject has obesity, or a susceptibility 
thereto. 
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ABSTRACT 
TITLE; MOLECULES 

The present invention describes the chromosomal location of obesity susceptibility genes 
and provides methods for detecting the presence of such genes. The present invention enables 
the development of novel therapies to treat obesity. 
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SEQUENCE LISTING 

<110> AstraZeneca AB 

5 <120> Molecules 

<130> JAN/100852 FEB 03 

<140> ■ 
10 <141> 



15 



20 



<160> 75 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 2986 
<212> DNA 

<213> Homo sapiens 



<400> 1 

ctgaaaacag aagatagaga ggagtctcgg agctcgccat ctccagcgat ctctacattg 60 
ggaaaaaaca tggagtcagc tccggbagcc cccgaccccg ccgccagcga gccaggcagc 120 
agcggcgcgg acgcggccgc cggctccagg gagaccccgc tgaaccagga atccgcccgc 180 
25 aagagcgagc cgcctgcccc ggtgcgcaga cagagctatt ccagcaccag cagagatgat 240 
gacttttttc atgaactccc agaaactttt ccttctgatc cacctgagcc tctgccacat 300 
ttccttattg agcctgaaga agcttatatt gtgaagaata agcctgtgaa cctgtactgt 360 
aaagcaagcc ctgccaccca gatctatttc aagtgtaata gtgaatgggt tcatcagaag 420 
gaccacatag tagatgaaag agtagatgaa acttccggtc tcattgtccg ggaagtgagc 480 

30 attgagattt cgcgccagca agtggaagaa ctctttggac ctgaagatta ctggtgccag 540 
tgtgtggcct ggagctccgc gggtaccaca aagagccgga aggcgtatgt gcgcattgca 600 
tatctacgga agacatttga gcaggaaccc ctaggaaagg aagtgtcttt ggaacaggaa 660 
gtcttactcc agtgtcgacc acctgaaggg atcccagtgg ctgaggtgga atggttgaaa 720 
aatgaagaca taattgatcc cgttgaagat cggaattttt atattactat tgatcacaac 780 

35 ctcatcataa agcaggcccg actctcfcgat actgcaaatt acacctgtgt tgccaaaaac 840 
attgttgcca agaggaaaag tacaactgcc actgtcatag tctatgtcaa cggtggctgg 900 
tccacctgga cggagtggtc tgtgtgtaac agccgctgtg gacgagggta tcagaaacgt 960 
acaaggactt gtaccaaccc ggcaccactc. aatgggggtg ccttctgtga agggcagagt 1020 
gtgcagaaaa tagcctgtac tacgttatgc ccagtggatg gcaggtggac gccatggagc 1080 

40 aagtggtcta cttgtggaac tgagtgcacc cactggcgca ggagggagtg cacggcgcca 1140 
gcccccaaga atggaggcaa ggactgcgac ggcctcgtct tgcaatccaa gaactgcact 1200 
gatgggcttt gcatgcagag tttcatttat cccatttcaa ctgaacagag aacccagaat 1260 
gaatatggat tttcttctgc tcctgattca gatgatgttg ctctctatgt tgggattgtg 1320 
atagcagtga tcgtttgcct ggcgatctct gtagttgtgg ccttgtttgt gtatcggaag 1380 

45 aatcatcgtg actttgagtc agatattatt gactcttcgg cactcaatgg gggctttcag 1440 
cctgtgaaca tcaaggcagc aagacaagat ctgctggctg taqccccaga cctcacgtca ■ 1500 
gctgcagcca tgtacagagg acctgtctat gccctgcatg acgtctcaga caaaatccca 156 0 
atgaccaact ctccaattct ggatccactg cccaacctga aaatcaaagt gtacaacacc 1620 
tcaggtgctg tcacccccca agatgacctc tctgagttta cgtccaagct gtcccctcag 1680 

50 atgacccagt cgttgttgga gaatgaagcc ctcagcctga agaaccagag tctagcaagg 1740 
cagactgatc catcctgtac* cgcatttggc agcttcaact cactgggagg tcaccttatt 1800 
gttcccaatt caggagtcag cttgctgatt cccgctgggg ccattcccca agggagagtc 1860 
tacgaaatgt atgtgactgt acacaggaaa gaaactatga ggccacccat ggatgactct 1920 
cagacacttt tgacccctgt ggtgagctgt gggcccccag gagctctgct cacccgcccc 1980 

55 gtcgtcctca ctatgcatca ctgcgcagac cccaataccg aggactggaa aatactgctc 2040 
ctcgctggag tacagcatcc gagtctactg tctggatgac acccaggatg ccctgaagga 2100 
aattttacat cttgagagac agacgggagg acagctccta gaagaaccta aggctcttca 2160 
ttttaaaggc agcacccaca acctgcgcct gtcaattcac gatatcgccc attccctctg 2220 
gaagagcaaa ttgctggcta aatatcagga aattccattt taccatgttt ggagtggatc 2280 

60 tcaaagaaac ctgcactgca ccttcactct ggaaagattt agcctgaaca cagtggagct 2340 
ggtttgcaaa ctctgtgtgc ggcaggtgga aggagaaggg . cagatcttcc agctcaactg 2400 
caccgtgtca gaggaaccta ctggcatcga tttgccgctg ctggatcctg cgaacaccat 2460 
caccacggtc acggggccca gtgctttcag catccctctc cctatccggc agaagctctg 2520 
tagcagcctg gatgcccccc agacgagagg ccatgactgg aggatgctgg cccataagct 2580 

65 gaacctggac aggtacttga attactttgc caccaaatcc agcccaactg gcgtaatcct 2640 
ggatctttgg gaagcacaga acttcccaga tggaaacctg agcatgctgg cagctgtctt '2700 
ggaagaaatg ggaagacatg aaacggtggt gtccttagca gcagaagggc agtattaacc 2760 
accatgctgg aaggggaaat 2780 
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<210> 2 

<211> 964 

5 <212> PRT 

<213> Homo sapiens 

<400> 2 

Met Glu Ser Ala Pro Ala Ala Pro Asp Pro Ala Ala Ser Glu Pro Gly 
10 1 5 10 15 

Ser Ser Gly Ala Asp Ala Ala Ala Gly. Ser Arg Glu Thr Pro Leu Asn 
20 25 30 

15 Gin Glu Ser Ala Arg Lys Ser Glu Pro Pro Ala Pro Val Arg Arg Gin 
■ 35 ~ 40 . 45 • 

Ser Tyr Ser Ser Thr Ser Arg Asp Asp Asp Phe Phe His Glu Leu Pro 
50 55 60 

20 

Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe Leu He 
65 70 75 80 

Glu Pro Glu Glu Ala Tyr He Val Lys Asn Lys Pro Val Asn Leu Tyr 
25 85 90 95 

Cys Lys Ala Ser Pro Ala Thr Gin He Tyr Phe Lys Cys Asn Ser Glu 
100 105 HO 

.30 Trp Val His Gin Lys Asp His lie Val Asp Glu Arg Val Asp Glu Thr 
115 120 125 

Ser Gly Leu He Val Arg Glu VaJ. Ser He Glu He Ser Arg Gin Gin 
130 135- 140 

35 

Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys Val Ala 
145 150 155 160 

Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val Arg He 
40 ~ 165 170 175 

Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys Glu Val 
180 185 - 190 

45 Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu Gly He 
195 200 205 

Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp He He Asp Pro 
210 215 220 

50 

Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu He He 
225 " ^ 230 235 * 240 

Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val Ala Lys 
55 245 2$0 255 

Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He Val Tyr 
260 265 270 

60 Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys Asn Ser 
275 280 285 

Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr Asn Pro 
290 " 295 300 

65 

Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val Gin Lys 
305 310 315 320 
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Ile Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr Pro Trp 
325 330 335 

Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg Arg Arg 
5 340 345 350 

Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys Asp Gly 
355 360 365 

10 Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met Gin Ser 
370 375 380 

Phe He Tyr Pro He Ser Thr Glu Gin Arg Thr Gin Asn Glu Tyr Gly 
385 390 395 400 

15 

Phe' Ser Ser Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He 
405 410 415 

Val He Ala Val ' He Val Cys Leu Ala He Ser Val Val Val Ala Leu 
20 420 425 430 

Phe Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp 
435 440 445 

25 Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala 
450 455 460 

Arg Gin Asp. Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala 
465 470 475 480 

30 

Met Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He 
485 - 490 495 

Pro Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys He 
35 500 505 510 

Lys Val Tyr Asn Thr Ser Gly Ala Val Thr Pro Gin Asp Asp Leu Ser 
515 520 525 

40 Glu Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Sex Leu Leu Glu 
530 535 540 

Asn Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp 
545 550 555 560 

45 

Pro Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu 
565 570 575 

lie Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Qly Ala He 
50 580 585 590 

Pro Gin Gly Arg Val tyr Glu Met Tyr Val Thr Val His Arg Lys Glu 
595 600 605 

55 Thr Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val 
610 615 620 

Val Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu 
625 * 630 635 640 

60 

Thr Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu 
645 650 655 

Leu Lys Asn Gin Ala Ala Gin Gly Gin Trp Glu Asp Val Val Val Val 
65 ' 660 665 670 

Gly Glu Glu Asn Phe Thr Thr Pro Cys Tyr He Gin Leu Asp Ala Glu 
675 680 685 
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Ala Cys His He Leu Thr Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly 
690 695 700 

5 His Ser Thr Thr Lys Ala Ala Ala Lys Arg Leu Lys Leu Ala He Phe 
705 710 715 720 

Gly Pro Leu Cys Cys Ser Ser Leu Glu Tyr Ser He Arg Val Tyr Cys 
725 730 735 

10 

Leu Asp Asp Thr Gin Asp Ala Leu Lys Glu He Leu His Leu Glu Arg 
740 745 '750 

Gin Thr Gly Gly Gin Leu Leu Glu Glu Pro Lys Ala Leu His Phe Lys 
15 755 760 765 

Gly Ser Thr His Asn Leu Arg Leu Ser He His Asp He Ala His Ser 
770 775 780 

20 Leu Trp Lys Ser Lys Leu Leu Ala Lys Tyr Gin Glu He Pro Phe Tyr 
785 790 . 795 800 

His Val Trp Ser Gly Ser Gin Arg Asn Leu His Cys Thr Phe Thr Leu 
805 810 ■ 815 

25 

Glu Arg Phe Ser Leu Asn Thr Val Glu Leu Val Cys Lys Leu Cys Val 
820 825 830 

Arg Gin Val Glu Gly Glu Gly Gin He Phe Gin Leu Asn Cys Thr Val 
30 835 840 845 

Ser Glu Glu Pro Thr Gly He Asp Leii Pro Leu Leu Asp Pro Ala Asn 
850" 855 860 

35 Thr He Thr Thr Val Thr Gly Pro Ser Ala Phe Ser He Pro Leu Pro 
865 870 875 880 

.He Arg Gin Lys Leu Cys Ser Ser Leu Asp Ala .Pro Gin Thr Arg Gly 
885 890 895 

40 

His Asp Trp Arg Met Leu Ala His Lys Leu Asn Leu Asp Arg Tyr Leu 
900 905 910 

Asn Tyr Phe Ala Thr Lys Ser Ser Pro Thr Gly Val He Leu Asp Leu 
45 915 920 925 

Trp Glu Ala Gin Asn Phe Pro Asp Gly Asn Leu Ser Met Leu Ala Ala 
930 935 • 940 

50 Val Leu Glu Glu Met Gly Arg His Glu Thr Val Val Ser Leu Ala Ala 
945 950 955 960 

Glu Gly Gin Tyr 

55 " r 

<210> 3 
<211> 2780 
<212> DNA . 
60 <213> Homo sapiens 



<400> 3 

ctgaaaacag aagatagaga ggagtctcgg agctcgccat ctccagcgat ctctacattg 60 

ggaaaaaaca tggagtcagc tccggcagcc cccgaccccg ccgccagcga gccaggcagc .120 
65 agcggcgcgg acgcggccgc cggctccagg gagaccccgc tgaaccagga atccgcccgc- 180 

aagagcgagc cgcctgcccc ggtgcgcaga cagagctatt ccagcaccag cagagatgat 240 

gacttttttc atgaactccc agaaactttt ccttctgatc cacctgagcc tctgccacat 300 

ttccttattg agcctgaaga agcttatatt gtgaagaata agcctgtgaa cctgtactgt 360 
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aaagcaagcc 
gaccacatag 
attgagattt 
tgtgtggcct 
5 tatcrtacgga 
gtcttactcc 
aatgaagaca 
ctcatcataa 
attgttgcca 

10 tccacctgga 
acaaggactt 
gtgcagaaaa 
aagtggtcta 
gcccccaaga 

15 gatgggcttt 
gaatatggat 
atagcagtga 
aatcatcgtg 
cctgtgaaca 

20 gctgcagcca 
atgaccaact 
tcaggtgctg 
atgacccagt 
cagactgatc 

25 gttcccaatt 
tacgaaatgt 
cagacacttt 
gtcgtcctca 
ctcgctggag 

30 aattttacat 
ttttaaaggc 
gaagagcaaa 
tcaaagaaac 
ggtttgcaaa 

35 caccgtgtca 
caccacggtc 
tagcagcctg 
gaacctggac 
ggatctttgg 

40 ggaagaaatg 
accatgctgg 



ctgccaccca 
tagatgaaag 
cgcgccagca 
ggagctccgc 
agacatttga 
agtgtcgacc 
taattgatcc 
agcaggcccg 
agaggaaaag 
cggagtggtc 
gtaccaaccc 
tagcctgtac 
cttgtggaac 
atggaggcaa 
gcatgcagag 
tttcttctgc 
tcgtttgcct 
actttgagtc 
tcaaggcagc 
tgfcacagagg 
ctccaattct 
tcacccccca 
cgttgttgga 
catcctgtac 
caggagtcag 
atgtgactgt 
tgacccctgt 
ctatgcatca 
tacagcatcc 
cttgagagac 
agcacccaca 
ttgctggcta 
ctgcactgca 
ctctgtgtgc 
gaggaaccta 
acggggccca 
gatgcccccc 
aggtacttga 
gaagcacaga 
ggaagacatg 
aaggggaaat 



gatctatttc 
agtagatgaa 
agtggaagaa 
gggtaccaca 
gcaggaaccc 
acctgaaggg 
cgttgaagat 
actctctgat 
tacaactgcc 
tgtgtgtaac 
ggcaccactc 
tacgttatgc 
tgagtgcacc 
ggactgcgac 
tttcatttat 
tcctgattca 
ggcgatctct 
agatattatt 
aagacaagat 
acctgtctat 
ggatccactg 
agatgacctc 
gaatgaagcc 
cgcatttggc 
cttgctgatt 
acacaggaaa 

ggtgagctgt 

ctgcgcagac 
gagtctactg 
agacgggagg 
acctgcgcct 
aatatcagga 
ccttcactct 
ggcaggtgga 
ctggcatcga 
gtgctttcag 
agacgagagg 
attactttgc 
acttcccaga 
aaacggtggt 



aagtgtaata 
acttccggtc 
ctctttggac 
aagagccgga 
ctaggaaagg 
atcccagtgg 
cggaattttt 
actgcaaatt 
actgtcatag 
agccgctgtg 
aatgggggtg 
ccagtggatg 
cactggcgca 
ggcctcgtct 
cccatttcaa 
gatgatgttg 
gtagttgtgg 
gactcttcgg 
ctgctggctg 
gccctgcatg 
cccaacctga 
tctgagttta 
ctcagcctga 
agcttcaact 
cccgctgggg 
gaaactatga 
gggcccccag 
cccaataccg 
tctggatgac 
acagctccta 
gtcaattcac 
aattccattt 
ggaaagattt 
aggagaaggg 
tttgccgctg 
catccctctc 
ccatgactgg 
caccaaatcc 
tggaaacctg 
gtccttagca 



gtgaatgggt 
tcattgtccg 
ctgaagatta 
aggcgtatgt 
aagtgtcttt 
ctgaggtgga 
atattactat 
acacctgtgt 
tctatgtcaa 
gacgagggta 
ccttctgtga 
gcaggtggac 
ggagggagtg 
tgcaatccaa 
ctgaacagag 
ctctctatgt 
^cttgtttgt 
cactcaatgg 
tacccccaga 
acgtctcaga 
aaatcaaagt 
cgtccaagct 
agaaccagag 
cactgggagg 
ccattcccca 
ggccacccat 
gagctctgct 
aggactggaa 
acccaggatg 
gaagaaccta 
gatatcgccc 
taccatgttt 
agcctgaaca 
cagatcttcc 
ctggatcctg 
cctatccggc 
aggatgctgg 
agcccaactg 
agcatgctgg 
gcagaagggc 



tcatcagaag 
ggaagtgagc 
ctggtgccag 
gcgcattgca 
ggaacaggaa 
atggttgaaa 
tgatcacaac 
tgccaaaaac 
cggtggctgg 
t'cagaaacgt 
agggcagagt 
gccatggagc 
cacggcgcca 
gaactgcact 
aacccagaat 
tgggattgtg 
gtatcggaag 
gggctttcag 
cctcacgtca 
caaaatccca 
gtacaacacc 
gtcccctcag 
totagcaagg 
tcaccttatt 
agggagagtc 
ggatgactct 
cacccgcccc 
aatactgctc 
ccctgaagga 
aggctcttca 
attccctctg 
ggagtggatc 
cagtggagct 
agctcaactg 
cgaacaccat 
agaagctctg 
cccataagct 
gcgtaatcct 
cagctgtctt 
agtattaacc 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2S20 

2580 

2640 

2700 

2760 

2780 



<210> 4 

45 <211> 669 

<212> PRT . 

<213> Homo sapiens 

<400> 4 

50 Met Glu Ser Ala Pro Ala Ala Pro Asp Pro Ala Ala Ser Glu Pro Gly 
15 10 15 



55 



60 



Ser Ser Gly Ala Asp Ala Ala Ala Gly Ser Arg Glu Thr Pro Leu Asn 
20 25 ' 30 

Gin Glu Ser Ala Arg Lys Ser Glu Pro Pro Ala Pro Val Arg Arg Gin 
35 40 45 

Ser Tyr Ser Ser Thr Ser Arg Asp Asp Asp Phe Phe.His Glu Leu Pro 
50 55 60 

Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe Leu lie 
65 70 75 80 



65 Glu Pro Glu Glu Ala' Tyr lie Val Lys Asn Lys Pro Val Asn Leu Tyr 

85 90 95 



Cys Lys Ala Ser Pro Ala Thr Gin lie Tyr Phe Lys Cys Asn Ser Glu 
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100 105 HO 

Trp val His Gin Lys Asp His He Val Asp Glu Arg Val Asp Glu Thr 
115 120 125 

5 ser Gly Leu He Val Arg Glu Val Ser He Glu He Ser Arg Gin Gin 
130 135 140 

Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys Val Ala 
i0 145 150 155 160 

Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val Arg He 
165 170 175 

15 Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys Glu Val 
180 185 . 190 

Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu Gly He 
195 200 205 

20 

Pro Val Ala Glu Val Glu -Trp Leu Lys Asn Glu Asp He He Asp Pro 
210 215 220 

Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu He He 
25 225 * ~ 230 235 240 

Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val Ala Lys 
245 250 255 

30 Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He Val Tyr 
260 265 270 

Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys Asn Ser . 
275 280 285 

35 

Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr Asn Pro 
290 295 300 

Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val Gin Lys 
40 305 310 315 320 

He Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr Pro Trp 
325 330 335 

45 Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg Arg Arg 
340 345 350 . 

Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys Asp Gly 
355 360 365 

50 

Leu Val Leu Glh Ser Lys Asn Cys Thr Asp Gly Leu Cys Met Gin Ser 
370 . 375 380 

Phe He Tyr Pro He Ser Thr Glu Gin Arg Thr Gin Asn Glu Tyr Gly 
55 385 " 390 395 400 

Phe Ser Ser Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He 
405 410 415 

60 Val He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu 
420 425 430 

Phe Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp 
435 ' 440 445 

65 

Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala 
450 455 460 
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Arg Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala 
465 ~ 470 475 480 

Met Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He 
5 485 490 495 

Pro Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys He 
500 505' 510 

10 Lys Val Tyr Asn Thr Ser Gly Ala Val Thr Pro Gin Asp Asp Leu Ser 
515 520 525 



15 



Glu Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu 
530 535 540 

Asn Glu Ala Leu Ser Leu , Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp 
545 550 555 560 



Pro Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu 
20 565 570 575 

He Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala He 
580 585 590 

25 Pro Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu 
595 600 605 



30 



Thr Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val 
610 615 620 

Val Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu 
625 * 630 635 640 



Thr Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu 
35 645 650 655 



40 



45 



Leu Leu Ala Gly Val Gin His Pro Ser Leu Leu Ser Gly 
660 665 



<210> 5 

<211> 1863 

<212> DNA 

<213> Homo sapiens 



<400> 5 

gttttttttt tttttttggt accatagagt tgctctgaaa acagaagata gagggagtct 6 0 
cggagctcgc atctccagcg atctctacat tgggaaaaaa catggagtca gctccggcag 120 
cccccgaccc cgccgccagc gagccaggca gcagcggcgc ggacgcggcc gccggctcca 180 

50 gggagacccc gctgaaccag gaatccgccc gcaagagcga gccgcctgcc ccggtgcgca 240 
gacagagcta ttccagcacc agcagaggta tctcagtaac gaagaagaca catacatctc 300 
aaattgaaat tattccatgc aagatctgtg gagacaaatc atcaggaatc cattatggtg 3 60 
tcattacatg tgaaggctgc aagggctttt tcaggagaag tcagcaaagc aatgccacct 420 
actcctgtcc tcgtcagaag aactgtttga ttgatcgaac cagtagaaac cgctgccaac 4 80 

55 - actgtcgatt acagaaatgc cttgccgtag ggatgtctcg agatgctgta aaatttggcc 540 
gaatgtcaaa aaagcagaga gacagcttgt atgcagaagt acagaaacac cggatgcagc 600 
agcagcagcg cgaccaccag cagcagcctg gagaggctga gcpgctgacg cccacctaca 660 
acatctcggc caacgggctg acggaacttc acgacgacct cagtaactac attgacgggc 720 
acacccctga ggggagtaag gcagactccg ccgtcagcag cttctacctg- gacatacagc 780 

60 cttccccaga ccagtcaggt cttgatatca atggaatcaa accagaacca atatgtgact 840 
acacaccagc atcaggcttc tttccctact gttcgttcac caacggcgag acttccccaa 900 
ctgtgtccat ggcagaatta gaacaccttg cacagaatat atctaaatcg catctggaaa 960 
cctgccaata cttgagagaa gagctccagc agataacgtg gcagaccttt ttacaggaag 1020 
aaattgagaa ctatcaaaac aagcagcggg aggtgatgtg gcaattgtgt gccatcaaaa 1080 

65 ttacagaagc tatacagtat gtggtggagt ttgccaaacg cattgatgga tttatggaac 1140 
tgtgtcaaaa tgatcaaatt gtgcttctaa aagcaggttc tctagaggtg gtgtttatca 1200 
gaatgtgccg tgcctttgac tctcagaaca acaccgtgta ctttgatggg aagtatgcca 1260 
gccccgacgt cttcaaatcc ttaggttgtg aagaetttat tagctttgtg tttgaatttg 1320 
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gaaagagttt atgttctatg cacctgactg aagatgaaat tgcattattt tctgcatttg 13 80 
tactgatgtc agcagatcgc tcatggctgc aagaaaaggt aaaaattgaa aaactgcaac 1440 
agaaaattca gctagctctt caacacgtcc tacagaagaa tcaccgagaa gatggaatac 1500 
taacaaagtt aatatgcaag gtgtctacat taagagcctt atgtggacga catacagaaa 1560 
5 agctaatggc atttaaagca atatacccag acattgtgcg acttcatttt cctccattat 1620 
acaaggagtt gttcacttca gaatttgagc cagcaatgca aattgatggg taaatgttat 1680 
cacctaagca cttctagaat gtctgaagta caaacatgaa aaacaaacaa aaaaattaac 1740 
cgagacactt tatatggccc tgcacagacc tggagcgcca cacactgcac atcttttggt 1800 
gatcggggtc aggcaaagga ggggaaacaa tgaaaacaaa taaagttgaa cttgtttttc 1860 
10 tea * . 1863 

<210> 6 
<211> 523 
15 <212> PRT 

<213> Homo sapiens 

<400> 6 

Met Glu Ser Ala Pro Ala Ala Pro Asp Pro Ala Ala Ser Glu Pro Gly 
20 1 5 10 15 

Ser" Ser Gly Ala Asp Ala Ala Ala Gly Ser Arg Glu Thr Pro Leu Asn 
20 25 30 

25 Gin Glu Ser Ala Arg Uys Ser Glu Pro Pro Ala Pro Val Arg Arg Gin 
35 40 45 

Ser Tyr Ser Ser Thr' Ser Arg Gly He Ser Val Thr Lys Lys Thr His 
50 -55 60 

30 

Thr Ser Gin He Glu He He Pro Cys Lys He Cys Gly Asp Lys Ser 
65 70 75 80 

Ser Gly He His Tyr Gly Val He Thr Cys Glu Gly Cys Lys Gly Phe 
35 85 90 95 

Phe Arg Arg Ser Gin Gin Ser Asn Ala Thr Tyr Ser Cys Pro Arg Gin 
100 105 110 

40 Lys Asn Cys Leu He Asp Arg Thr Ser Arg Asn Arg Cys Gin His Cys 
115 120 125 

Arg Leu Gin Lys Cys Leu Ala Val Gly Met Ser Arg Asp Ala Val Lys 
130 135 140 

45 

Phe Gly Arg Met Ser Lys Lys Gin Arg Asp Ser Leu Tyr Ala Glu Val 
145 150- 155 160 

Gin Lys His Arg Met Gin Gin Gin Gin Arg Asp His Gin Gin Gin Pro 
50 165 ' 170 175 

Gly Glu Ala Glu Pro Leu Thr Pro Thr Tyr Asn He Ser Ala Asn Gly 
180 185 190 

55 Leu Thr Glu Leu His Asp Asp Leu Ser Asn Tyr He Asp Gly His Thr 
195 200 205 

Pro Glu Gly Ser Lys Ala Asp Ser Ala Val Ser Ser Phe Tyr Leu Asp 
210 215 220 

60 

He Gin Pro Ser Pro Asp Gin Ser Gly Leu Asp He Asn Gly He Lys 
225 230 235 240 

Pro Glu Pro He Cys Asp Tyr Thr Pro Ala Ser Gly Phe Phe Pro Tyr 
65 245 * 250 255 

Cys Ser Phe Thr Asn Gly Glu Thr Ser Pro Thr Val Ser Met Ala Glu 
260 265 270 
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Leu Glu His Leu Ala Gin Asn He Ser Lys Ser His Leu Glu Thr Cys 
275 280 285 

5 Gin Tyr Leu Arg Glu Glu Leu Gin Gin He Thr Trp Gin Thr Phe Leu 
290 295 300 

Gin Glu Glu He Glu Asn Ty* Gin Asn Lys Gin Arg Glu Val Met Trp 
305 310 315 320 

10 • 

Gin Leu Cys Ala He Lys He Thr Glu Ala He Gin Tyr Val Val Glu 
325 330 335 

Phe Ala Lys Arg He Asp Gly Phe Met Glu Leu Cys Gin As A Asp Gin 
15 340 345 . 350 

lie Val Leu Leu Lys Ala Gly Ser Leu Glu Val Val Phe He Arg Met 
355 ~ 360 365 

20 Cys Arg Ala Phe Asp Ser Gin Asn Asn Thr Val Tyr Phe Asp Gly Lys 
370 375 380 



25 



Tyr Ala Ser Pro Asp Val Phe Lys Ser Leu Gly Cys Glu Asp Phe He 
385 390 395 400 

Ser Phe Val Phe Glu Phe Gly Lys Ser Leu Cys Ser Met His Leu Thr 
405 410 415 



Glu Asp Glu He Ala Leu Phe Ser Ala Phe Val Leu Met Ser Ala Asp 
30 420 425 430 

Arg Ser Trp Leu Gin Glu Lys Val Lys He Glu Lys Leu Glri Gin Lys 

435 440 445 

35 He Gin Leu Ala Leu Gin His Val Leu Gin Lys Asn His Arg Glu Asp 
450 4$5 460 



40 



Gly He Leu Thr Lys Leu He Cys Lye Val Ser Thr Leu Arg Ala Leu 

465 470 . 475 480 

Cys Gly Arg His Thr Glu Lys Leu Met Ala Phe Lys Ala He Tyr Pro 

485 490 495 



Asp He Val Arg Leu His Phe Pro Pro Leu Tyr Lys Glu Leu Phe Thr 
45 500 505 510 



50 



55 



Ser Glu Phe Glu Pro Ala Met Gin He Asp Gly 
515 520 



<210> 7 

<211> 1821 • 

<212> DNA 

<213> Homo sapiens 



<400> 7 

ctagattcct ctgtgtttca tacccaggag gtcagggagg gcaccttgta gtgcctgagg 60 
ccccgagagg gttcacagcc acctgagaag tgtttgcagt ctatgggtgt gtgttctttc 120 
tccaagctgt gtcagctgtg ggatggcgtg gagagaaaac aagaaaaagt tgtttccctc 180 

60 agctctctgc tgaaaaaatg ctcacgggga gtcagtcaac atttggtatc tcagtaacga 240 
agaagacaca tacatctcaa attgaaatta ttccatgcaa gatctgtgga gacaaatcat 300 
caggaatcca ttatggtgtc attacatgtg aaggctgcaa gggctttttc aggagaagtc 3 60 
agcaaagcaa tgccacctac tcctgtcctc gtcagaagaa ctgtttgatt gatcgaacca 420 
gtagaaaccg ctgccaacac tgtcgattac agaaatgcct tgcegtaggg atgtctcgag 480 

65 atgctgtaaa atttggccga atgtcaaaaa agcagagaga cagcttgtat gcagaagtac 540 
agaaacaccg gatgcagcag cagcagcgcg accaccagca gcagcctgga gaggctgagc 600 
cgctgacgcc cacctacaac atctcggcca acgggctgac ggaacttcac gacgacctca 660 
gtaactacat tgacgggcac acccctgagg ggagtaaggc agactccgcc gtcagcagct 720 
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tctacctgga-catacagcct tccccagacc agtcaggtct tgatatcaat ggaatcaaac 780 
cagaaccaat atgtgactac acaccagcat caggcttctt tccctactgt tcgttcacca 840 
acggcgagac ttccccaact gtgtccatgg "cagaattaga acaccttgca cagaatatat 900 
ctaaatcgca tctggaaacc tgccaatact tgagagaaga gctccagcag ataacgtggc 960 
5 agaccttttt acaggaagaa attgagaact- atcaaaacaa gcagcgggag gtgatgtggc 1020 
aattgtgtgc catcaaaatt acagaagcta tacagtatgt ggtggagttt gccaaacgca 1080 
ttgatggatt tatggaactg tgtcaaaatg atcaaattgt gcttctaaaa gcaggttctc 1140 
tagaggtggt gtttatcaga atgtgccgtg cctttgactc tcagaacaac accgtgtact 1200 
tcgatgggaa gtatgccagc cccgacgtct tcaaatcctt aggttgtgaa gactttatta 1260 

10 gctttgtgtt tgaatttgga aagagtttat gttctatgca cctgactgaa gatgaaattg 1320 
cattattttc tgcatttgta ctgatgtcag cagatcgctc atggctgcaa gaaaaggtaa 1380 
aaattgaaaa actgcaacag aaaattcagc tagctcttca acacgtccta cagaagaatc 1440 
accgagaaga tggaatacta acaaagttaa tatgcaaggt gtctacatta agagccttat 1500 
gtggacgaca tacagaaaag ctaatggcat ttaaagcaat atacccagac attgtgcgac 1560 

15 ttcattttcc tccattatac aaggagttgt tcacttcaga atttgagcca gcaatgcaaa 1620 
ttgatgggta aatgttatca cctaagcact tctagaatg^ ctgaagtaca aacatgaaaa 1680 
acaaacaaaa aaattaaccg agacacttta tatggccctg cacagacctg ' gagcgccaca 1740 
cactgcacat cttttggtga tcggggtcag gcaaaggagg ggaaacaatg aaaacaaata 1800 
aagttgaact "tgtttttctc a N 1821 

20 

<210> 8 
<211> 477 
<212> PRT 
25 <213> Homo sapiens 



30 



<400> 8 

Met Leu Thr Gly Ser Gin Ser Thr Phe Gly lie Ser Val Thr Lys Lys 
1 5 10 , 15 

Thr His Thr Ser Gin lie Glu He He Pro Cys Lys lie Cys Gly Asp 
20 25 30 



Lys Ser Ser Gly He His Tyr Gly Val He Thr Cys Glu Gly Cys Lys 

35 35 40 45 

Gly Phe Phe Arg Arg Ser Gin Gin Ser Asn Ala Thr Tyr Ser Cys Pro 

50 55 60 

40 Arg Gin Lys Asn Cys Leu He Asp Arg Thr Ser Arg Asn Arg Cys Gin 

65 70 75 80 



45 



His Cys Arg Leu Gin Lys Cys Leu Ala Val Gly Met Ser Arg Asp Ala 
85 90 95 

Val Lys Phe Gly. Arg Met Ser Lys Lys Gin Arg Asp Ser Leu Tyr Ala 
100 105 HO 



Glu Val Gin Lys His Arg Met Gin Gin .Gin Gin Arg Asp His Gin Gin 
50 115 120 125 

Gin Pro Gly Glu Ala Glu Pro Leu Thr Pro Thr Tyr Asn He Ser Ala 
130 135 140 

55 Asn Gly Leu Thr Glu Leu His Asp Asp Leu Ser Asn Tyr He Asp Gly 
145 150 155 160 

His Thr Pro Glu Gly Ser Lys Ala Asp .Ser Ala Val Ser Ser Phe Tyr 
165 170 175 

60 

Leu Asp He Gin Pro Ser Pro Asp Gin Ser Gly Leu Asp He Asn Gly 
180 185 190 

He Lys Pro Glu Pro He Cys Asp Tyr Thr Pro Ala Ser Gly Phe Phe 
65 195 200 205 



Pro Tyr Cys Ser Phe Thr Asn Gly Glu Thr Ser Pro Thr Val Ser Met 
210 215 220 * 
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Ala Glu Leu Glu His Leu Ala Gin Asn He Ser Lys Ser His Leu Glu 

225 230 235 240 

5 Thr Cys Gin Tyr Leu Arg Glu Glu Leu Gin Gin He Thr Trp Gin Thr 

245 250 255 



10 



Phe Leu Gin Glu Glu He Glu Asn Tyr Gin Asn Lys Gin Arg Glu Val 
260 265 270 

Met Trp Gin Leu Cys Ala He Lys He Thr Glu Ala He Gin Tyr Val 
275 280 285 



Val Glu Phe Ala Lys Arg He Asp Gly Phe Met Glu Leu Cys Gin Asn 
15 290 295 300 

Asp Gin He Val Leu Leu Lys Ala Gly Ser Leu Glu Val Val Phe He 
305 310 315 -320 

20 Arg Met Cys Arg Ala Phe Asp Ser Gin Asn Asn Thr Val Tyr Phe Asp 

325 330 335 



25 



Gly Lys Tyr Ala Ser Pro Asp Val Phe Lys Ser Leu Gly Cys Glu Asp 
340 345 350 

Phe He Ser Phe Val Phe Glu Phe Gly Lys Ser Leu Cys Ser Met His 
355 360 365 



Leu Thr Glu Asp Glu He Ala Leu Phe Ser Ala Phe Val Leu Met Ser 
30 370 . " 375 380 

Ala Asp Arg Ser Trp Leu Gin Glu Lys Val Lys He Glu Lys Leu Gin 
385 390 395 400 

35 Gin Lys He Gin Leu Ala Leu Gin His Val Leu Gin Lys Asn His Arg 

405 410 415 



40 



Glu Asp Gly He Leu Thr Lys Leu He Cys Lys Val Ser Thr Leu Arg 
420 425 430 

Ala Leu Cys Gly Arg His Thr Glu Lys Leu Met Ala Phe Lys Ala He 
435 440 445 



Tyr Pro Asp- He Val Arg Leu His Phe Pro Pro Leu Tyr Lys Glu Leu 
45 450 455 460 



50 



55 



Phe Thr Ser Glu Phe Glu Pro Ala Met Gin He Asp Gly 
465 470 475 



<210> 9 

<211> 3646 

<212> DNA ' 

<213> Homo sapiens 



• <400> 9 

ctgcctttgg agaaagtgga gtgtggcgct tggttgtcgt tatttcttcg gactgcttcg 60 

cggtgcacgg attcagcttc tgcccagtgg ggctttcagc tgtttgcgcg tctctctgtc 120 

cccctcccct ccccccggca cacctctgtc tacgatgagg aaaggtctgc gggcgacagc 180 

60 ggcccgctgc ggactgggac tgggatactt gctgcaaatg ctcgtgctac ctgccctggc 240 

cctgctcagc gccagcggca ctggctccgc cgcccaagat gatgactttt ttcatgaact 300 

cccagaaact tttccttctg atccacctga gcctctgcca catttcctta ttgagcctga 360 

agaagcttat attgtgaaga ataagcctgt gaacctgtac tgtaaagcaa gccctgccac 420 

ccagatctat ttcaagtgta atagtgaatg ggttcatcag aaggaccaca tagtagatga 480 

65 aagagtagat gaaacttccg gtctcattgt ccgggaagtg agcattgaga tttcgcgcca 54 0 

gcaagtggaa gaactctttg gacctgaaga ttactggtgc cagtgtgtgg cctggagctc 600 

cgcgggtacc acaaagagcc ggaaggcgta tgtgcgcatt gcatatctac ggaagacatt 660 

tgagcaggaa cccctaggaa aggaagtgtc tttggaacag gaagtcttac tccagtgtcg 720 
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accacctgaa 
tcccgttgaa 
ccgactctct 
aagtacaact 
5 gtctgtgtgt 
cccggcacca 
tactacgtta 
aactgagtgc 
caaggactgc 

10 gactgctcct 
ttgcctggcg 
tgagtcagat 
ggcagcaaga 
cagaggacct 

15 aattctggat 
cccccaagat 
gttggagaat 
ctgtaccgca 
agtcagcttg 

20 gactgtacac 
ccctgtggtg 
gcatcactgc 
acagggacag 
cattaagctg 

25 ggtaggacat 
cctgtgctgc 
tgccctgaag 
taaggctctt 
ccattccctc 

30 ttggagtgga 
cacagtggag 
ccagctcaac 
tgcgaacacc 
gcagaagctc 

35 ggcccataag 
tggcgtaatc 
ggcagctgtc 
gcagtattaa 
ggccgtccag 

40 gcaagaatgc 
ttactcggag 
ctafecaacct 
ttcctaggaa 
aatttctaca 

45 gttctatttt 
cagattagta 
) ctcagataat 
tgtatcaggg 
ggagggaccc 

50 



gggatcccag 
gatcggaatt 
gatactgcaa 
gccactgtca 
aacagccgct 
ctcaatgggg 
tgcccagtgg 
acccactggc 
gacggcctcg 
gattcagatg 
atctctgtag 
attattgact 
caagatctgc 
gtctatgccc 
ccactgccca 
gacctctctg 
gaagccctca 
tttggcasct 
ctgattcccg 
aggaaagaaa 
agctgtgggc 
gcagacccca 
tgggaggatg 
gatgcagagg 
tccaccacca 
tcctcgctgg 
gaaattttac 
cattttaaag 
tggaagagca 
tctcaaagaa 
ctggtttgca 
tgcaccgtgt 
atcaccacgg 
tgtagcagcc 
ctgaacctgg 
ctggatcttt 
ttggaagaaa 
ccaccatgct 
gtgaatcaca 
agcaggagcc 
atcatcacgg 
aattttcctc 
acgcttttta 
aatgttgttt 
tgttcaaatc 
atttacagag 
cctcccatca 
aagtggagaa 
ttctggtgcc 



tggctgaggt 
tttatattac 
attacacctg 
tagtctatgt 
gtggacgagg 
gtgccttctg 
atggcaggtg 
gcaggaggga 
tcttgcaatc 
atgttgctct 
ttgtggcctt 
cttcggcact 
tggctgtacc 
tgcatgacgt 
acctgaaaat 
agtttacgtc 
gcctgaagaa 
tcaactcgct 
ctggggccat 
ctatgaggcc 
ccccaggagc 
ataccgagga 
tggtggtggt 
cctgccacat 
aagcggctgc 
agtacagcat 
atcttgagag 
gcagcaccca 
aattgctggc 
acctgcactg 
aactctgtgt 
cagaggaacc 
tcacggggcc 
tggatgcccc 
acaggtactt 
gggaagcaca 
tgggaagaca 
ggaaggggaa 
gctgaggagg 
agaaggaaaa 
gagttaagaa 
ttagttgggc 
ttgctatcca 
acatcaaatt 
atttgaagtt 
aaatcacaga 
gtgttctgtt 
tctaagtgta 
ctttgggaat 



- 12- 

ggaatggttg 
tattgatcac 
tgttgccaaa 
caacggtggc 
gtatcagaaa 
tgaagggcag 
gacgccatgg 
gtgcacggcg 
caagaactgc 
ctatgttggg 
gtttgtgtat 
caatgggggc 
cccagacctc 
ctcagacaaa 
caaagtgtac 
caagctgtcc 
ccagagtcta 
gggaggtcac 
tccccaaggg 
acccatggat 
tctgctcacc 
ctggaaaata 
cggggaggaa 
cctcacagag 
aaagcg'cctc 
ccgagtctac 
acagacggga 
caacctgcgc 
taaatatcag 
caccttcact 



gcggcaggtg 
tactggcatc 
cagtgctttc 
ccagacgaga 
gaattacttt 
gaacttccca 
tgaaacggtg 
atgaaggaca 
aaatccagat 
cagatacaac 
aaattgtgta 
tgtatgctgt 
gatatatgga 
ggacagggat 
gaagctgtgg 
cttttgctac 
tctagaactt 
aaaaagaaat 
aaagctgtag 



aaaaatgaag 
aacctcatca 
aacattgttg 
tggtccacct 
cgtacaagga 
agtgtgcaga 
agcaagtggt 
ccagccccca 
actgatgggc 
attgtgatag 
cggaagaatc 
tttcarcctg 
acgtcagctg 
atcccaatga 
aacacctcag 
cctcagatga 
gcaaggcaga 
cttattgttc 
agagtctacg 
§actctcaga 
cgccccgtcg 
ctgctcaaga 
aacttcacca 
aacctcagca 
aagctggcca 
tgtctggatg 
ggacagctcc 
ctgtcaattc 
gaaattccat 
ctggaaagat 
gaaggagaag 
gatttgccgc 
agcatccctc 
ggccatgact 
gccaccaaat 
gatggaaacc 
gtgtccttag 
aaaatgcaca 
gagaccaatg 
tgcccatgta 
aatttgtacc 
gtggtacagg 
taaactttct 



gcagacactg 
acggtttgtt 
aaatcgtgtg 
gtagaaccag 
aactaagact 
cactgc 



acataattga 
taaagcaggc 
ccaagaggaa 
ggacggagtg 
cttgtaccaa 
aaatagcctg 
ctacttgtgg 
agaatggagg 
tttgcatgca 
cagtgatcgt 
atcgtgactt 
tgaacatcaa 
cagccatgta 
ccaactctcc 
gtgctgtctc 
cccagtcgtt 
ctgatccatc 
ccaattcagg 
aaatgtatgt 
cacttttgac 
tcctcactat 
accaggcagc 
ccccctgcta 
cctacgccct 
tctttgggcc 
acacccagga 
tagaagaacc 
acgatatcgc 
tttaccatgt 
ttagcctgaa 
ggcagatctt 
tgctggatcc 
tccctatccg 
ggaggatgct 
ccagcccaac 
tgagcatgct 
cagcagaagg 
gggagtctgt 
cacttcacag 
catgcccact 
ttgaatttag 
atcttacagt 
taacaaaccc 
tccatggctc 
gtgtctattt 
catcaagtgt 
tgttactgtt 
cctattcctt 



780 
840 
900 
960 
1020 
1080 ' 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3646 



<210> 10 
<211> 931 
<212> PRT 
55 <213> Homo sapiens 



60 



<400> 10 

Met Arg Lys Gly Lieu Arg Ala Thr Ala Ala Arg Cys Gly Leu Gly Leu 
!* , 5 10 15 

Gly Tyr Leu Leu Gin Met Leu Val Leu Pro Ala Leu Ala Leu Leu Ser 
20 25 30 



Ala Ser Gly Thr Gly Ser Ala Ala Gin Asp Asp Asp Phe Phe His Glu 
65 35 40 45 



Leu Pro Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe 
50 55 60 
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Leu He Glu Pro Glu Glu Ala Tyr He Val Lys Asn Lys Pro Val Asn 
65 70 75 - 80 

5 Leu Tyr Cys Lys Ala Ser Pro Ala Thr Gin He Tyr Phe Lys Cys Asn 

85 90 95 

Ser Glu Trp Val His Gin Lys Asp His He Val Asp Glu Arg Val Asp 
100 105 H° 

Glu Thr Ser Gly Leu He Val Arg Glu Val Ser He .Glu He Ser Arg 
115 . 120. ' 125 

Gin Gin Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys 
15 130 135 140 

Val Ala Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val 
145 ~ 150 155 • * 160 

20 Arg He Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys 

165 170 175 

Glu Val Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu 
180 185 ' ' 190 

25 

Gly He Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp He He 
195 200 205 

Asp Pro Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu 
30 210 215 220 

He He Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val 
225 230 • 235 240 

35 Ala Lys Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He 

245 250 255 

Val Tyr Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys 
260 265 270 

40 

Asn Ser Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr 
275 280 • 285 

Asn Pro Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val 
45 290 295 300 

Gin Lys He Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr 
305 310 315 320 

50 Pro Trp Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg 

325 330 335 

Arg Arg Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys 
340 . 345 350 

55 Asp Gly Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met 
355 360 365 

Gin Thr Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He Val 
60 370 375 380 

He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu Phe 
385 390 395 400 

65 Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp Ser 

405 410 415 

Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala Ar$ 
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420 425 430 

Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala Met 
435 440 445 

Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys lie Pro 
450 . 455 460 

Met Thr Asn Ser Pro lie Leu Asp Pro Leu Pro Asn Leu Lys lie Lys 
10 465 470 475 480 

Val Tyr Asn Thr Ser Gly Ala Val Ser Pro Gin Asp Asp Leu Ser Glu 
485 490 495 

15 Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn 
500 505 510 

Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro 
515 " 520 525 

20 \ . 

Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu He 
530 "* 535 540 

Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala He Pro 
25 545 550 555 560 

Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu Thr 
565 570 575 

30 Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val 
580 585 590 

Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr 
595 600 -605 

35 

Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu 
610 615 620 

Lys Asn Gin Ala Ala Gin Gly Gin Trp Glu Asp Val Val Val Val Gly 
40 625 630 635 640 

Glu Glu Asn Phe Thr Thr Pro Cys Tyr He Lys Leu Asp Ala Glu Ala 
645 650 655 

45 Cys His He Leu Thr Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly His 
660- 665 670 

Ser Thr Thr Lys Ala Ala Ala Lys Arg Leu Lys Leu Ala He Phe Gly 
675 680 685 

50 

Pro Leu Cys Cys Ser Ser Leu Glu Tyr Ser He Arg Val Tyr Cys Leu 
690 695 700 

Asp Asp Thr Gin Asp Ala Leu Lys Glu He Leu His Leu Glu Arg Gin 
55 705 710 715 720 

* Thr Gly Gly Gin Leu Leu Glu Glu Pro Lys Ala Leu His Phe Lys Gly 
725 730 735 

60 Ser Thr His Asn Leu Arg Leu Ser He His Asp He Ala His Ser Leu 
740 745 . 750 

Trp Lys Ser Lys Leu Leu Ala Lys Tyr Gin Glu He Pro Phe Tyr His 
755 760 765 

^ Val Trp Ser Gly Ser Gin Arg Asn Leu His Cys Thr Phe Thr Leu Glu 
770 "* 775 780 
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Arg Phe Ser Leu Asn Thr Val Glu Leu Val Cys Lys Leu Cys Val Arg 
785 790 795 800 

Gin Val Glu Gly Glu Gly Gin lie Phe Gin Leu Asn Cys Thr Val Ser 
5 805 810 815 

Glu Glu Pro Thr Gly He Asp Leu Pro Leu Leu Asp Pro Ala Asn Thr 
820 825 030 

10 He Thr Thr Val Thr Gly Pro Ser Ala Phe Ser He Pro Leu Pro He 
835 840 845 

Arg, Gin Lys Leu Cys Ser -Ser Leu Asp Ala Pro Gin Thr Arg Gly His 
850 855 860 

^ Asp Trp Arg Met Leu Ala His Lys Leu Asn Leu Asp Arg Tyr Leu Asn 
865 870 875 880 

Tyr Phe Ala Thr Lys Ser Ser Pro Thr Gly Val He Leu Asp Leu Trp 
20 885 890 895 

Glu Ala Gin Asn Phe Pro Asp Gly Asn Leu Ser Met Leu Ala Ala Val 
900 -90S 910 

25 Leu Glu Glu Met Gly Arg His Glu Thr Val Val Ser Leu Ala Ala Glu 
915 920 925 



Gly Gin Tyr 
930 

30 



<210> 11 
<211> 6277 
<212> DNA 
35 <213> Homo sapiens 

<400> 11 

ataaagcaca tttagcattg • ttgtgtttta 
agcatataat taagattttt tgaataaact 

40 tatcattacc ctaatgccca tttttggttg 
caaggatgtc catggttttg gaatttaaga 
aattaaagta gcagatggat ttaaagttgc 
tcttggatta ttccattggg tccaatatta 
agaagaagtg agtgctttga tacgagaagt 

45 gggggaaggg gccactagcc aaagaatgta. 
aaacatgctt tctgtagagg ctgcagaaag 
ttaaggctgc cctctggtcc tagccttggc 
acaatttatt ttttaaaaag tcatcaagag 
. tcttatgtaa tctaatcaag gaaatggaat 

50 ctcaagagaa aaacatatac atggcaaata 
ttggaattct gtctacaaca ctcctggtat 
cacatatata tacgtacata cacatccatg 
gtagacatat cagcttctta taatatttga 
tttcgctgtt ttagctacca gatcccatct 

55 gaaagtttct ctatgcatct tttctctcgt 
aacagcaaaa ttattgactg acacttcaaa 
aatatttttc tttatgaggg tgaagtagaa 
gaaaaaactg taacaaaaaa atcttcacac 
agtgattaat taaattccaa tgctcctctg 

60 aatatcgccg ggcatggtgg ctcacgtctg 
gcagatcacg aggtcaggag atcgagacca 
actaaaaata caaaaaaaaa aaaagttagg 
gctactcggg aggctgaggc agaagaatgg 
gccaagatcg tgccactgca ctccagcctg 

65 aaaaaaaaaa ttataactaa agttttgaat 
cccttaaata tatgtaataa aaaattaaag 
ccctgaaaga taacaaatga tattataaac 
tacataattt tatgagcaaa atggtgtgga 



ataactgcaa atgttctaaa tgtgaataag 60 
cctaaaattg aaatgtttgt gttgtcttta 120 
aggtatttga cagccagaat aatggctccc 180 
atatgttaca ttacatggca aatctgtaga 240 
taatctgctg actttaagag aggtagattt 3 00 
tcacaataat ttctaaatgt ggaggaaatt 360 
acttgaacca tacttgctgg gtgtgaagat 420 
ggtgtcctct ggaagctgga aacagctaga 480 
gagctcagct ctgggatcat tctcagttcc 540 
tagcctaggc cttggccttc tcatagaatc 600 
aattaatctc tcaagcttgc catgatcatg 660 
atcccctcat atccataagc cctggccaca 720 
tagcagagaa caagaaattt ggaggccatc 780 
tccctctgtg tgtttgtctt tgtctctctc 840 . 
cacatatgta tttgtattac ttttttcaat 900 
tttctttgtt tttccactag ttcagtcata 960 
gatacatgaa aacaacatga agaagataaa 102 0 
gacattcaat tgcatactca ccatcaccca 1080 
taatctcttc taatacttat ttacattaaa 1140 
attaattcac agctctgtac tctggacaaa 1200 
tgatctaaca ctttatattc tcatgagtca 1260 
aaggaggttg ggcatttcta taaattaaaa 1320 
taatcccagc actttgggaa gccaaggcag 1380 
tcctggctta cacggtgaaa ccctgtctct 1440 
tgggcgtagt ggcgggcgcc tgtagtccca 1500 
cgtgaaccca ggaggcggag cttgcagtga 1560 
ggtgacagag cgagactcca tctcaaaaaa 1620 
taaatcattc tgattgtcca ggaatgcaca 1680 
atttcttata gaaaaactta ttgaaaaagt 1740 
tatgtatttc tgtagtaaat ttcccaaaaa 1800 
atatataaaa tattgtatgt atgtgtgtgt 1860 
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atataagctt tttaagtgaa gttaagaaac atgaatgagg acctatgaag cattatttct 1920 
gtgcacgtac caatatcctg ttatttccag tgcccaaccc taagatctct gactctacca 1980 
cattaacttc agactaaaaa ctaacccttt taaacacacc gtaaatgatt tttttttgtt 2040 
aaatgcaact taaaaatttc aaaatatata atagtaaaaa tgatgtttaa tatattgtgt 2100 
5 tgaaatgtta aagaataaac tgtagtttga atgcatctga atattttgta ggcatgcaac 2160 
agctcttagt aagaaaccat tccatctttc tgagagaaag ttgtatatta actgaagaga 2220 
ggcatcttgt tttgatctgg aggcacaaga aatcttaaag tgtgaatccc attaggttat 2280 
aagataaaaa aaaatcctat cagagaatag gaaaataaaa accactagta atctcattac 2340 
tgcctcttca aatactactg ctatataggt gaacagcttc ccctctagtc tttcctcttt 2400 
10 cttcccaaaa tagccgacga gtggatcaag catgtctagt gggatataag cacaaagtat 2460 
gcttattctt gttttctcct ctttttattt gttctgatta ttaatttttc ccctagtcat 2520 
agttcttttt ttggatgaaa tttagggcaa gtgttcaggc ttacctagtt acttcagatg 2580 
atgacatctc ctctaaacag taattaatat tttcagtcca tggtaatatt caacatttta 2640 
agctcattta aaatactaac tatgccccct gatatggttt gactctgtgt ctacacccaa 2700 
15 atctcatgtc aaattgtaat ccccacttat tgaaggtggg gactggtggg aggtgattgg 2760 
atcatagtcg ggggttccag tggtttagcg ccatcctccc tagggctgtc ttgtgataga 2820 
gttctcacca gatccgatga gatctgatgg tttcaaagtg tgtagcagtc tcagtctctc 2880 
tgtctgtctg tctgtctctc tctctgtctc tcaatcccct gctccaccat ggtaagatgt 2940 
gcttacttcc cctttatgtt ttgacacgat tgtaagtttc ctgaggcctc ccagccatgc 3000 
20 ttctgtacag cctgaggaac tgtgagtcaa ttaaatgtct tttcttcata aactacccag 3060 
tctcaggtag ttctttatag cagcatgaaa acagactaat acaccccctt tctcagcctg 3120 
tgcttgtggt aaggctagcc aatgcaagca gaaagagggg tgctgtctgc tgtttctcca 3180 
attggcatct gtgccccttc ttcatcttgt cagttatagt gggaggccca agacttactt 3240 
ttggcctcag acacagtcaa ggatcatcct gatgcaaaaa acaaaatcac ctttgaacct 3300 
25 cttcctccac accatcttcc actttggttt tccgctatgc ctttgtactg gggacaaaga 3360 
ctatccacga atccttactt taaaaactga ggcagtggct ctgaatcatt cccccttgga 3420 
cacccccaca catctgaaca ccctcgtgtc ccaacaccac atctgctgaa gttgactgga 3480 
cctgaaatgc tggaggctca gtgttttcgg aaactgcgga gagttctcct gccacaattc 3540 
tcttgttgtc cgtcaaaagc aattacatgt ttcagatgat gaaaagcact gtcatgaaaa 3 600 
30 gctggcacag cagccaagga gacagatgta ttaaaccaca cggcttctca gaactcccac 3660 
ttacgttgga gctcagcaca gagatgagca tggtgatatc cccccgtcta acatcctccc 3720 
atttggattt aatgcatgag gctcgtttct gatgaggtgc caaattcttt cggtggacaa 3780 
acacaaagtt gcagaattca gtgatcatac aagtccatgt cctcaaacga aaataataat 3840 
actataaatg caaagatgct accataaaaa atacagtagg tggacttttg tgagtcaaca 3 900 
35 taaaaaaatc cagcatttct aactggtagc atgaggagta ccagtctgca ggatgttaac 3 960 
gcactgatag gctatggaca atatattcca cacttacgga agtggggaaa ctataggtca 4020 
gacaaagcta caggtacttc tttctttctt ttttcttttt ttctttctga gatggagtct 4080 
ccctctgtcg cctaggctgg ' agtgcaatgg catgatctca gctcactgca acctccgcct 4140 
cccgggttca agtgattctc ctgcctcagc ctcctgagtc gctgggatta taggcgcaca 4200 
40 ccaccaacca cccggctaat ttttcatatt tttagtagaa acgaggtttc actgtgttag 4260 
ccagaatggt ctcgaactcc tgacqtaggt gatctgcctg cctcagccte ccaaagtgct 4320 
gggattacag gtgtgagcca cggcactcgg cccaagtact tcttaactgc aggacttctc 43 80 
agaggctttt aatatggtaa tgagtgctgt gcagtgagat gactcaaatc aatatatagc 4440 
caacacgttt gatatgacca cagaacccat ttccctgaac actttttatc acctcctaga 4500 
45 gcaggggtca gaaaactact gcccatgtgg tctgatctgg ccactgcttg ttttcgtaaa 4560 
ttttcactgc ccacagccaa cctcatttgt gtacatgccg tccatggctg ctttcacact 4620 
gcagccgcag cgatgagtca tgagacacgc acccgtggct cacaaagtct aaaatattta 4680 
ctctttggcc ctttacggaa aacgtgtaca aactttgtcc tctgaaatat agtttgggaa 4740 
cctacttcac tcatgatatc aataattcta aaaaatttca aacaattttt aaagtcaaca 4 800 
50 ttgagaagtt ttccatggtt agaatttaat aattacagtc aataactctc ttaaagcatc 4860 
gtgacaataa agtagaagga cactgtgtaa ttctatatgg tctctgtctt cagggagctc 4920 
ccaagctagt ttgtgaaaaa gagaactcaa ggcacaaaca actggggggc aagagatgtt 4980 
ctaaacacta agaggaggca atatggcata gtagaaagaa tgcaagcatg tgtaagagta 5040 
aaacatacac agggcgcaga gtcacaagag tcaggggttg gtcttggctg tgataccttc 5100 
55 tcctgcgata agagtaagac ctgctttatt gagcacttac tatgtgccag gcaccatccc 5160 
agttgctttc catgaaggat -tgctctttaa ttcttatcat aatgaatagg tactattatt. 5220 
atcccagttt atgggtgagg aaaccaagtc ttaggttgac taacttatta aaataaagca 5280 
catacctagt aggtggtgaa caagagccaa aacaagtctg ttggtcagct aaccaccacg 5340 
ttatagtggc tgctgcttag gggcagcata tatcctctgc atttccccag aggtgacaga 5400 
60 agtagtggag agggtagtgc tctctaggct ctgacacaca gcccatgttt gctgactcaa 5460 
caaccacaca acgtaatgct atggcacctc aggtggcagg gagggtatcg tgggctggac 5520 
tgatcttcag aaggctgcat ctgagcaggg tgtggaagga tggcagaagg aagaggggag 5580 
tgaggcactc agggtagaca gaaggtgata ctgccacatc ctccactcag aaggaatctg 5640 
gaaggttttg acaaatatgt acattttgga gtgggaacag cagaaagtag gggctggcta 5700 
65 cctactttcc actaccccta tccccactcc ccctaggggg tgaggcagag ataagcacgc 5760 
tttgtggggc acactgatcc acaaacttgc aagcctgagc tccactcagc agcagcctgg 5820 
gcaggatgaa tggtggtgtg caggtgtttg cggcatgcat agcttctctt tctcctagag 5880 
tcaatccagt gatcccctgc gaaaagaatc ctccacttga cctgaatgct ctgtattggc 5940 
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accttctgga caggaatcct atcgggtatt 
gtggctccat gagagtaaga caccaggaca 
tgtctgcatc ccagagagtg ggagaggctt 
caacacttct acatgcaaag gaatctgaag 
5 ggcctca.ctg accattcccg tgagcctcag 
gcagtccaac tctgcagcac gaattaccaa 



tcatcaaagc atcccacagc agttttaagt 6000 
gctgtgccac ctagtgatgc ttgcctgtac 6060 
cggaaaggac tggtatttgc tatttttggc 6120 
tctccgcctg ggcttcaaaa catttcttgt 6180 
gggacatgag tgtcttcaag ctaagaggga 6240 
ggaacat 6277 



<210> 12 
10 <211> 1829 
<212> DNA 
<213> Homo sapiens 

<400> 12 t t - A 

15 ttttgccttg gttttcccag ctattaaaag ggagatatca tcattcattc aacaagcatt 60 
tcttgcggac ccattttcca cgacagaacc atccaaacca agacacttct tcctattgtg 120 
gcaaaaatga gcaacgttaa aactccccag aatgcagagg tgactcacgt gtttatgtaa 180 
gtaagcattt agatatggct ttggaaaaga aaacacacac ctaaaagagg ctaccagcat 240 
ttgaataaag tgtgtaacct gggtgttcaa aactgtacaa cagtttctgt gtactttaag 300 
20 tcactatcct tcatacttct tccctttaaa ttctatttga aactaaataa tatcaacaat 360 
actttaagta agattatgct ggtgttactg atgaaattat atgcaggtct gcttggcctg 420 
ttctcatgcg taagcctttg ataacttgtt cctaggttat ccagttctgc ttacctgagt 480 
• acccacttta tcccagaaac gagacctgaa cactgtgtcc tggtttcatc tttccagtga 540 
gccagccttc tgaggcatta gcggctcagc aagaccatgg aggtgagaga ggcacaggca 600 
25 cactccagat gcagggtctc tgccctctgt caagagtcac acggcagaga ggacaagact 660 
cctgtcccgg gggaactgtt tttccttttg ccccacaccc cagatatctt cgtgacagcc 720 
cctgcectga aagcccggtg agcgtcatgg cggagaagcc agcctgthtc cgattattct 780 
gcctgacagg gactatctgc tgctgtgtgc agttgtagaa atgtcataag ctcaggactc 840 
aggacacttt gacgtcgagt ccaacaggac ccatagagtc cccctgaaca attctcatct 900 
30 gtgtgggaag cctaaggggt ggfctttaagt aggggctggc aggagagtga ggtctgggct 960 
gcttctttac ggctcttcaa gtgccggcca tttcctggga ttcccaggag taagtcttga 1020 
ctcccatgaa tcccactgac ccctaagtgg catgggaaag gagaggacaa aaagaaagga 1080 
gttagacatt tgactaatac tgttcagagt tgaaatgcgc tctttgagca ggtaggcatt 1140 
taaagctaac ttttcctcat gggatacact tgtgggttct ttggagactg atctgcgtga 1200 
35 ccccaatggc tcaacgagaa tcttctccta caattccctt gactaggtaa tgccctcact 1260 
ctcctgctat cttcttctcc ctggttctca gtgtttcact gccacataaa ctctctctct 1320 
cactgttggg gtcccattgc ccctctgagt ggaattcctg ggcaaaagat tcaatgcaag 1380 
ctcaccaggc tatctcacat tgggcctaca tctggtctcc tggaaatatg cccatatcct 1440 
ttgtctggac aatcctgaga cagacccagc ccaatgcaga agtgcctatt cttctctcag 1500 
40 cagcagatta acttgtctgg gagttactca tcctcagact tctcaagctc attccgacac 1560 
ctgtccacct aatgcagcaa ctgcacgtga cacaataaac acttaa.tgca agtcacaaat 1620 
gggagtcaca tatataatat tattagtagc cacgctagaa aaatgagaca gaaacagata 1680 
aagttaattt caatttcagt aacatttcat ttaatctaat ctgtcaaaaa tactattgtc 1740 
tcaatatata atcaacaaaa ctgttaatga aatacattac agtttttttg tattaagtct 1800 
45 ttgaaatgga gtgtgtattt tagacttaa 18 



<210> 13 
<211> 2612 
50 <212> DNA 

<213> Homo sapiens 

<400> 13 

atgaggaaag gtctgcgggc 
55 caaatgctcg tgctacctgc 
caagatgatg acttttttca 
ctgccacatt tccttattga 
ctgtactgta aagcaagccc 
catcagaagg accacatagt 
60 gaagtgagca ttgagatttc 
tggtgccagt gtgtggcctg 
cgcattgcat atctacggaa 
gaacaggaag tcttactcca 
tggttgaaaa atgaagacat 
65 gatcacaacc tcatcataaa 
gccaaaaaca ttgttgccaa 
99tggctggt ccacctggac 
cagaaacgta caaggacttg 



gacagcggcc cgctgcggac 
cctggccctg ctcagcgcca 
tgaactccca gaaacttttc 
gcctgaagaa gcttatattg 
tgccacccag atctatttca 
agatgaaaga gtagatgaaa 
gcgccagcaa gtggaagaac 
gagctccgcg ggtaccacaa 
gacatttgag caggaacccc 
gtgtcgacca cctgaaggga 
aattgatccc gttgaagatc 
gcaggcccga ctctctgata 
gaggaaaagt acaactgcca 
ggagtggtct gtgtgtaaca 
taccaacccg gcaccactca 



tgggactggg atacttgctg 60 
gcggcactgg ctccgccgcc 120 
cttctgatcc acctgagcct 180 
tgaagaataa gcctgtgaac 240 
agtgtaatag tgaatgggtt 3 00 
cttccggtct cattgtccgg 360 
tctttggacc tgaagattac 420 
agagccggaa ggcgtatgtg 480 
taggaaagga agtgtctttg 540 
tcccagtggc tgaggtggaa 600 
ggaattttta tattactatt 660 
ctgcaaatta cacctgtgtt 720 
ctgtcatagt ctatgtcaac 7 80 
gccgctgtgg acgagggtat 840 
atgggggtgc cttctgtgaa 900 
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gggcagagtg 
ccatggagca 
acggcgccag 
aactgcactg 
5 gttgggattg 
gtgtatcgga 

gggggctttc 

gacctcacgt 
gacaaaatcc 

10 gtgtacaaca 
ctgtcccctc 
agtctagcaa 
ggtcacctta 
caagggagag 

15 atggatgact 
ctcacccgcc 
• aaaatactgc 
tgccctgaag 
taaggctctt 

20 ccattccctc 
ttggagtgga 
cacagtggag 
ccagctcaac 
tgcgaacacc 

25 gcagaagctc 
ggcccataag 
tggcgtaatc 
ggcagctgtc 
gcagtattaa 

30 



tgcagaaaat 

agtggtctac 

cccccaagaa 

atgggctttg 

tgatagcagt 

agaatcatcg 

agcctgtgaa 

cagctgcagc 

caatgaccaa 

cctcaggtgc 

agatgaccca 

ggcagactga 

ttgttcccaa 

tctacgaaat 

ctcagacact 

ccgtcgtcct 

tcctcgctgg 

gaaattttac 

cattttaaag 

tggaagagca 

tctcaaagaa 

ctggtttgca 

tgcaccgtgt 

atcaccacgg 

tgtagcagcc 

ctgaacctgg 

ctggatcttt 

ttggaagaaa 

ccaccatgct 



agcctgtact 

ttgtggaact 

tggaggcaag 

catgcagact 

gatcgtttgc 

tgactttgag 

catcaaggca 

catgtacaga 

ctctccaatt 

tgtcaccccc 

gtcgttgttg 

tccatcctgt 

ttcaggagtc 

gtatgtgact 

tttgacccct 

cactatgcat 

agtacagcat 

atcttgagag 

gcagcaccca 

aattgctggc 

acctgcactg 

aactctgtgt 

cagaggaacc 

tcacggggcc 

tggatgcccc 

acaggtactt 

gggaagcaca 

tgggaagaca 

ggaaggggaa 



acgttatgcc 

gagtgcaccc 

gactgcgacg 

gctcctgatt 

ctggcgatct 

tcagatatta 

gcaagacaag 

ggacctgtct 

ctggatccac 

caagatgacc 

gagaatgaag 

accgcatttg 

agcttgctga 

gtacacagga 

gtggtgagct 

cactgcgcag 

ccgagtctac 

acagacggga 

caacctgcgc 

taaatatcag 

'caccttcact 

gcggcaggtg 

tactggcatc 

cagtgctttc 

ccagacgaga 

gaattacttt 

gaacttccca 

tgaaacggtg 

at 



cagtggatgg 

actggcgcag 

gcctcgtctt 

cagatgatgt 

ctgtagttgt 

ttgactcttc 

atctgctggc 

atgccctgca 

tgcccaacct 

tctctgagtt 

ccctcagcct 

gcagcttcaa 

ttcccgctgg 

aagaaactat 

gtgggccccc 

acoccaatac 

tgtctggatg 

ggacagctcc 

ctgtcaattc 

gaaattccat 

ctggaaagat 

gaaggagaag 

gatttgccgc 

agcatccctc 

ggccatgact 

gccaccaaat 

gatggaaacc 

gtgtccttag 



caggtggacg 

gagggagtgc 

gcaatccaag 

tgctctctat 

ggccttgttt 

ggcactcaat 

tgtaccccca 

tgacgtctca 

gaaaatcaaa 

tacgtccaag 

gaagaaccag 

ctcactggga 

ggccattccc 

gaggccaccc 

aggagctctg 

cgaggactgg 

acacccagga 

tagaagaacc 

acgatatcgc 

tttaccatgt 

ttagcctgaa 

ggcagatctt 

tgctggatcc 

tccctatccg 

ggaggatgct 

ccagcccaac 

tgagcatgct 

cagcagaagg 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 • 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2612 



<210> 14 
<211> 636 
<212> PRT 
35 <213> Homo sapiens 



40 



<400> 14 

Met Arg Lys Gly Leu Arg Ala Thr Ala Ala Arg Cys Gly Leu Gly Leu 
1 5 10 15 

Gly Tyr Leu. Leu Gin Met Leu Val Leu Pro Ala Leu Ala Leu Leu Ser 
20 25 30 



Ala Ser Gly Thr Gly Ser Ala Ala Gin Asp Asp Asp Phe Phe His Glu 
45 35 40 . 45 

Leu Pro Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe 
50 55 60 

50 Leu He Glu Pro Glu Glu Ala Tyr He Val Lys Asn Lys Pro Val Asn 
65 70 75 80 



55 



Leu Tyr Cys Lys Ala Ser Pro Ala Thr Gin He Tyr Phe Lys Cys Asn 
85 90 95 

Ser Glu Trp Val His Gin Lys Asp His He Val Asp Glu Arg Val Asp 
100 105 HO 



Glu Thr Ser Gly Leu He Val Arg Glu Vai Ser He Glu He Ser Arg 
60 - 115 120 125 

Gin Gin Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys 
130 135 140. 

65 Val Ala Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val 

145 150 155 160 



Arg He Ala Tyr Leu Arg Lys Thr Phe Glu- Gin -Glu Pro Leu Gly Lys 
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165 170 175 

Glu Val Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu 
180 185 190 

^ Gly He Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp He He 
195 200 205 

Asp Pro Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu 
10 210 215 220 

He He Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val 
225 230 235 240 

15 Ala Lys Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He 
245 250 255 

'» 

Val Tyr Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys 
260 265 270 

20 

Asn Ser Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr 
275 280 285 

Asn Pro Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val 
25 290 295 300 

Gin Lys He Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr 
305 310 315 320 

30 Pro Trp Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg 

325 330 335 

Arg Arg Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys 
340 345 350 

35 

Asp Gly Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met 
355 360 365 

Gin Thr Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He Val 
40 370 375 380 

He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu Phe 
385 390 395 400 

45 Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp Ser 

405 410 415 

Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala Arg 
420 425 430 

50 

Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala Met 
435 440 445 

Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He Pro 
55 450 455 460 

Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys He Lys 
465 470 475 480 

60 Val Tyr Asn Thr Ser Gly Ala Val Thr Pro .Gin Asp Asp Leu Ser Glu 
- 485 490 495 

Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn 
500 505 510 

65 

Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro 
515 520 525 
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Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu lie 
530 535 540 

Val Pro Asn Ser Gly Val Ser Leu Leu lie Pro Ala Gly Ala lie Pro 
5 545 550 555 560 

Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu Thr 
565 570 . 575 

10 Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val 
580 585 590 

Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr 
595 , 600 605 

* 5 Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu 
610 615 ' 620 

Leu Ala Gly Val Gin His Pro Ser Leu Leu Ser Gly 
20 625 630 • 635 



<210> 15 

<211> 28 

25 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
30 MF402L11I 

<400> 15 

atggagactt cgctcactgg tcctcttt 

35 

<210> 16 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

40 

<220> ^ . 

<223> Description of Artificial Sequence : Reverse Primer 

MF402L11I 

45 <400> "16 

* cctggaggca cataaaccct agacatcc 



<210> 17 
50 <211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: Forward Primer 
MF402L11II 

<400> 17 

acatcctgag gtaggtggtc c 

60 

<210> 18 ■ 
<211> 21 
<212> DNA 
65 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence -.Reverse 
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MF402L11II 



<400> 18 

gggtctcact cattctgttg c 



<210> 19 
<211> 21 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF402IillIII 

<400> 19 

ctgtattttg actcaattgt g 



15 



20 <210> 20 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence: Reverse 
Primer MF402L11III 

<400> 20 
30 ccaaaagcct ctccttatta c 



<210> 21 • 
<211> 28 
35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
40 MF402L11IV 



<400> 21 

gaatacagac acacagagac agcacctt 

45 

<210> 22 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

50 

<220> ^ . 

<223> Description of Artificial Sequence: Reverse Primer 

MF402L11IV 

55 <400> 22 

gttccttgct ccagtgaaag acataatc 



<210> 23 
60 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

65 <223> Description of Artificial Sequence: Forward Primer 
MF402L11V 



<400> 23 



100852 

ctaacaccag ttcagtggaa cccattgc 



<210> 24 
5 <211>- 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

10 <223> Description of Artificial Sequence: Reverse Primer 
MF402L11V 



15 



<400> 24 

atttgatgtt catacagggt tgggggac 



<210> 25 
<211> 28 
<212> DNA 
20 <213> Artificial Sequence 



25 



<220> . 
<223> Description of Artificial Sequence: Forward Primer 
MF4 02Ij11VI 

<400> 25 

ctccacttgg actctgatga caccactg 



30 <210> 26 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



35 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF4 02L11VI 

<400> 26 

40 catagttcac tgcagcctca acctcct 



<210> 27 
<211> 28 
45 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
50 MF90A19I 



<400> 27 

taattatgat tagacagacc gtcctcca 

55 ■ 

<210> 28 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

60 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19I 



65 <400> 28 

agtttactac tcagcctttg ccttcact 
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<210> 29 
<211> 28. 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19II 

10 <400> 29 

tctgccccac gtccaatcta ggtaactt 



<210> 30 . 
15 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: Reverse Primer 
MF90A19II 

<400> 30 

gatggtgtgg aggaagaggt tcaaaggt 

25 

<210> 31 
<211> 28 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19III 

35 

<400> 31 

aactgagaac tggaaggtag ctttggga 



40 <210> 32 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
45 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19III 

<400> 32 

50 acaagagcag tcagaactat ttggccct 



<210> 33 
<211>' 28 
55 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
60 MF90A19IV 

<400> 33 

atttgctggc tccgacagta agtattcc 

65 

<210> 34 
<211> 28 
<212> DNA 



r 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Reverse Primer 
5 MF90A19IV 



10 



15 



<400> .34 ' 

atgcagtgat gcttaacacc ctgtctct 



<210> 35 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19V 

20 <400> 35 

cctgctctgt ctgccttgga aaagaaat 



<210> 36 

25 <211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

30 <223> Description of Artificial Sequence: Reverse Primer 
MF90A19V 



35 



45 



<400> 36 

tcttggccca gacggtataa gagtccat 



<210> 37 
<211> 28 
<212> DNA 
40 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19VI 

<400> 37 

agtgcaccag taagcatgcc ttccaata 



50 <210> 38 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
55 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19VI 

<400> 38 

60 aagctggcaa ggagattctc aaaagtgg 



<210> 39 " 
<211> 28 
65 <212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Forward Primer 
MF90A19VII 

<400> 39 

accacatctg ctgaagttga ctggacct 



<210> 40 
<211> 28 
10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
15 MF90A19VII 

<400> 40. 

tctgggcaac atagtgaaac cctgtctc 

20 

<210> 41 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19VIII 

30 <400> 41 

cttgaaccta ggagacggag gttgcagt 



<210> 42 
35 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: Reverse Primer 
MF90A19VIII 

<400> 42 

. aagcatatct ctgaccctct ggccattc 

45 

• <210> 43 
<211> 28 
<212> DNA 
50 <213> Artificial Sequence 

<220> , 
<223> Description of Artificial Sequence: Forward Primer 
MF90A19IX 

55 

<400> 43 

gaagctgcaa tcccatcctt gtctcttt 



60 <210> 44 

<211> 28 

<212> DNA 

• <213> Artificial Sequence 

65 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19IX 



100852 



r 



- 26 



<400> 44 

tggtggtgca cacctgtaat accagcta 



5 <21G> 45 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



10 ' <220> ^ ^ 

<223> Description of Artificial Sequence: Forward Primer 

C4P3F 

<400> 45 
15 aggatgtcca tggttttgga 



<210> 46 
<211> 20 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
25 C4P3R 



<400> 46 

ctgtttccag cttccagagg 

30 

<210> 47 
<211> 20 
<212> DNA 

<213> Artificial- Sequence 

35 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
C4P4F 



40 <400> 47 

catggtccca aagct'ctgtt 



<210> 48 
45 <211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

50 <223> Description of Artificial . Sequence: Reverse Primer 
C4P4R 

<400> 48 

tgacggaatt tcagtgcaac 



<210> 49 
<211> 20 
<212> DNA 
60 <213> Artificial Sequence 



<220> _ _ » 

<223> Description of Artificial Sequence: Forward Primer 

C4P7F 

65 

<400> 49 

tctgatacag gggatttggc 
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<210> 50 
<211> 20 
<212> DNA 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C4P7R 

10 

<400> 50 

ttcaggtggg tgatgtgtgt . 



15 <210> 51 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Forward Primer 
C4P8F 

<400> 51 
25 aggaaagagg ggtgctgtct 



<210> 52 
<211> 20 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
35 C4P8R 

<400> 52 

atagcaggag agtgagggca 

40 

<210> 53 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

45 • 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
C4P9F 

50 <400> 53 

ggcccacttt ' ctggttcata 



<210> 54 
55 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

60 <223> Description of Artificial Sequence: Reverse Primer 
C4P9R 

<400> 54 

tcagccagac cagtgtgaaa 



<210> 55 
<211> 20 
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10 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Forward Primer 
C15P1F 

<400> 55 

ggtctctgcc ctctgtcaag 



<210> 56 
<211> 20 
<212> DNA 
15 <213> Artificial Sequence 



20 



55 



<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C15P1R 

<40O> 56 

aggcttccca cacagatgag 



25 <210> 57 ' • 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
30 <220> 

<223> Description of Artificial Sequence: Forward Primer 
C15P2F 

<400> 57 
35 cactcttggc acccttaacc 



<210> 58 

<211> 20 

40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 

45 C15P2R 

<400> 58 

tttgttctgt cctccccaac 



50 



*210> 59 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Forward Primer 
C15P5F 

60 <4O0> 59 

tttcatcaaa gcatcccaca 



<210> 60 
65 <211> 20 
<212> DNA 

<213> Artificial Sequence 



100852 
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<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C15P5R 

5 <400> 60 

tggtaattcg tgctgcagag 



<210> 61 
10 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Forward Primer 
C15P6F 



20 



<400> 61 

cccaaacatt ccaacttcgt 



<210> 62 
<211> 20 
<212> DNA 
25 <213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C15P6R 

<400> 62 

tagctcaagc ctgaacccat 



35 <210> 63 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
40 <22Q> 

<223> Description of Artificial Sequence: Forward Primer 
BPC4F 

<400> 63 

45 accctaatgc ccatttttgg ttgagg 



<210> 64 
<211> 28 
50 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
55 BPC15R 

<400> 64 

aaataccagt cctttccgaa gcctctcc 

60 

<210> 65 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

65 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
BPC15F 



100852 



r 



10 



30 



<400> 65 

cgacagaacc atccaaacca -agacactt 



<210> 66 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Reverse Primer 
BPC4R 



15 <400> 66 

gtgtttattg tgtcacgtgc agttgctg 



•<210> 67 
20 <211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Forward Primer 
■RORal 5' TJTR 



30 



<400> 67 

ctgaaaacag aagatagagg gagtctc 



<210> 68 
<211> 20 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
UNCSc 3' UTR 



40 



<40'0> 68 

atttcccctt ccagcatggt 



45 <210>" 69 
<211> 27' 
<212> DNA 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence: Forward Primer- 
API 

<400> 69 

55 ccatcctaat acgactcact atagggc 



<210> 70 
<211> 22 
60 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse - Primer 
<400> 70 

ctccacagat cttgcatgga at 



65 
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<210> 71 
<211> 23 
<212> DNA 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
AP2 ' 

10 

<400> 71 

actcactata gggctcgagc ggc 



15 <210> 72 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Forward Primer 
<400> 72 

tctgtgtttc atacccagga ggt 

25 

<210> 73 
<211> 23 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 

35 <400> 73 „ . 

agaacacaca cccatagact gca 



<210> 74 
40 <211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence: Forward Primer 
<400> 74 

cctgtaccgc atttggcag 

50 

<210> 75 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

55 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 



<400> 75 
60 tgcaaaccag ctccactgtg 
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1 42 
AF055634 CTGGCTTTGGAGAAAGTGGAGTGTGGCGCTTGGTTGTCGTTAT 
UNC5C_exonl CTGCCTTTGGAGAAAGTGGAGTGTGGCGCTTGGTTGTCGTTAT 

43 102 
AF055634 TTCTTCGGACTGCTTCGCGGTGCACGGATTC^GCTTCTGCCCAGTGGGGCTTC 
UNC5C_exonl " TTCTTCGGACTGCTTCGCGGTGCACGGATTCAGCTTCTG 

SEQ ID NO: 1 CTGAAAACAGAAGATAGAGGGAGTCTCGGAGCTCGCCATCTCCAGCGATCTCTACATTG 
SEQ ID NO: 3 C T G AAAACAGAAGAT AGAGGGAGT CT CGGAGCT C G C CAT CT C CAG CGAT CT CT ACAT TG 

RORAl__exoHl C TGAAAACAGAAGAT AGAGGGAGT CT CGGAGCTC G C CATCT C CAG CGAT CT CT ACAT T G 

103 ' » 162 

AF 0 5 5 6 3 4 TTGCGCGTCTCTCTGTCCCCCTCCCCTCCCCCCGGCACACCTCTGTCTACGATGAGGAAA 
UNCSC_exonl TTGCGCGTCTCrCTCTCCCCCTCCCCTCCCCCOSGCACACCTCTG 

SEQ ID NO:l GGAAAAAACATGGAGTCAGCTCCGGCAGCCCCCGACCCCGCCGCCAGCGAGCCAGGCAGC 
SEQ ID NO: 3 GGAAAAAACATGGAGTCAGCTCCGGCAGCCCCCGACCCCGCCGCCAGCGAGCCAGGCAGC 
RORAl_exonl GGAAAAAACATGGAGTCAGCTCCGGCAGCCCCCGACCCCGCCGCCAGCGAGCCAGGCAGC 

163 222 
AF 0 5 5 6 3 4 GGTCTGCGGGCGAC!AGCGGCCCGCTGCX3GACTGGGACrGGGATACTTGCT 
UNC5C_exorLl GGTCTGCGGGCGACAGCGGCCCGCTGCGGACTGGGACTGGGAT 

SEQ ID NO:l AGCGGCGCGGACGCGGCCGCCGGCTCCAGGGAGACCCCGCTGAACCAGGAATCCGCCCGC 
SEQ ID NO: 3 AGCGGCGCGGACGCGGCCGCCGGCTCCAGGGAGACCCCGCTGAACCAGGAATCCGCCCGC 
RORAl_exonl AGCGGCGCGGACGCGGCCGCCGGCTCCAGGGAGACCCCGCTGAACCAGGAATCCGCCCGC 

223 282 
AF05 5634 GTGCTACCTGCCCTGGCCCTGCTCAGCGCCAGCGGCAC^ 

UNC5 C_exoill GTGCTACCTGCCCTGGCCCTGCTCAGCGCCAGCGGCACTGGCTCCGCCGCCCAAG 
UNC5C exon2 ATGAT 
SEQ ID NO:l AAGAGCGAGC CGCCTGCCCCGGTGCG CAGACAGAGCT ATTC CAGCACCAGCAGAGATGAT 

SEQ ID NO: 3 AAGAGCGAGCCGCCTGCCCCGGTGCGCAGACAGAGCTATTCCAGCACCAGCAGAGATGAT 
RORA1 exonl AAGAGCQAGCCGCCTGCCCCGGTGCGCAGACAGAGCTATTCCAGCACCAGCAGAG 

***** 

283 . 342 

AF 0 5 5 6 3 4 GACTTTTTTCATGAACTCCCAGAAACTTTTCC^ 

UNC5 C_exon2 GACTT T TTT CATGAACTC CCAGAAACTTTTCCTT CTG ATC CAC CTG AGC CTC TGC CACAT 

SEQ ID NO:l GACTTTTTOCATGAACTCCCAGAAACTTTTCCTTCTGATCC^ 
SEQ ID NO: 3 GACTTTTTTCATGAACTCCCAGAAACTTTTCCTTCT 

************************************************************ 

343 402 
AF0 5 S 6 3 4 TTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAGAA 

UNC 5 C_exon2 TTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAGAATAAGCCTCTGAACCTGTACTGT 
SEQ ID NO:l TTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAGAATAAGCCTGTCAACCTGTACTGT 
SEQ ID NO: 3 TTCCITATTGAGCCTGAAGAAGCTTATATTGTGAAGAATAAGC 

************************************************************ 

403 462 
AF0 5 S 6 3 4 AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCATCAGAAG 
UNC 5 C__exon2 AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCATCAGAAG 
SEQ ID NO: 1 AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCATCAGAAG 
SEQ ID NO: 3 AAAG C AAGC CCTGC CAC C CAG ATCT ATTT CAAGTG T AATAGTG AATGGGTT CAT CAG AAG 

************************************************************ 

463 522 
AF 0 5 5 6 3 4 GACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTCTCATTGTCCGGGAAGTGAGC 
UNC5C_exon2 GAC CA CATAGTAG ATG AAAG AGTAG ATGAAAC TT CCG 

UNC5C_exon3 . ' GTCTC AT TGT CCGGG AAGTG AG C 

SEQ ID NO:l GACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTCTCATTGTCCGGGAAGTGAGC 
SEQ ID NO: 3 GACCACATAGTAGATGAAAGAGTAGATX3AAACTTCCGGTCTCATTGTCCGGGAAGTGAGC 
************************************************************ 



523 582 
AF0 5 5 63 4 ATTG AG ATTTCG CG C CAG CAAGTGG AAGAACTCTTTGGAC CTG AAGATTACTGGTG C CAG 

UNC5C_exon3 ATTGAG ATTTCG CGC CAG C AAGTG GAAGAACTCTTT<^ AC CTGAAGATTACTGGTGC CAG 

SEQ ID NO:l ATTGAGATTTCGCGCCAGC^GTGGAAGAACTCTTTGGACCTGAAGATTACTGGTGCCAG 
SEQ ID NO: 3 ATTGAGATTTCX3CGCCAGCAAGTGGAAGAACTCTTTGGACCTC 

************************************************************ 
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UNC5C_exon3 
SEQ ID. NO:l 
SEQ ID NO: 3 
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583 642 
TGTGTGGCCTGGAGCTCCGCGGGTACCACAAAGAGCCGGAAC^CGTATGTGCGCATTGCA 
TGTGTGGCCTGGAGCTCCGCGGGTACCACAAAGAGCCGGAAGGCGTATGTGCGCATTO 
TGTGTGGC CTGGAG CT CCG CGGGT ACCACAAAGAG C CGG AAGG CGT ATGTG CG CATTG CA 
TGTGTGGCCTGGAG CTCCX3 CGGGTACCACAAAGAGCCGGAAGGCGTATGTGCGCATTGCA 
************************************************************ 



643 702 
AFO 55634 TAT CT ACGG AAG ACATTTG AGCAG GAAC C CCTAGG AAAGG AAGTGT CTTTGGAACAGG AA 

UNC5C_exon3 T 

UNC5C exon4 ATCTACGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTTGGAACAGGAA 
SEQ ID NO:l TATCTACGG AAG ACATTTG AGCAGGAACCCCTAGGAAAGGAAGTGTCTTTGGAACAGGAA 
SEQ ID NO: 3 TAT CT ACGGAAGACATTTG AG CAGGAAC C C CTAG GAAAGG AAGTG T CTTTGGAACAGGAA 

************************************************************ 



AF055634 
UNC5C_exon4 
UNC5C_exon5 
SEQ ID NO:l 
SEQ ID NO: 3 



703 762 
GTCTTACTTCCAGTGTC^ACC^CCrcAAGGGATCCC^GTGGCTGAGGTGGAATGGTTGAAA • 

GTCTTACTCCAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAG 

GTGGAATGGTTGAAA 
GTCTTACTCCIAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAGGTC 
GTCTTACTCCIAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAGGTGGAATGGTTGAAA 
************************************************************ 



763 822 
AFO 55634 AATGAAGAC!ATAATTGATCCCGTTGAAGATCGGAATTTTTA 

UNC5C exon5 AATGAAGAC!ATAATTGATCCCGTTGAAGATCGGAATTTTTATATTACTATTGATCACAAC 
SEQ ID NO:l AATG AAG ACAT AATTGATC C CGTT G AAG ATCGG AATTTTT AT ATT ACT ATTGAT C ACAAC 

SEQ ID NO: 3 AATGAAGACATAATTGATCCCGTTGAAGATCX3GAATTTTTATATT 

******************************************* ***************** 



823 



882 



AFO 5 5 6 3 4 CTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAATTACACCTGTGTTGCCA 
UNC5C exonS CTCATCATAAAGCAGGCCCGACTCTCTGATACJTGCAAATTACyVCCTGTGTTGCCAAAAAC 
SEQ ID NO:l CTCATCATA^GCAGGCCCGACTCTCTGATACTGCAAATTAC^CCTGTGTTGCCAAAAAC 
SEQ ID NO: 3 CTCATC^TAAAG^GGCCCGACTCTCTGATACTGCAAATTACACCTGTGTTGCCAAAAAC 
************************************************************ 



AF055634 
UNC5C_exon5 
UNC5C_exon6 
SEQ ID NO:l 
SEQ ID NO: 3 



883 



942 



ATTGTTGC CAAGAGGAAAAGTACAACTGC CACTGTCATAGTCTATGTCAACGGTGGCTGG 
ATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATG 

T CAACGGTG G CTG G 
ATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATGTCAAC^ 
ATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATGTCAACGGTGGCTGG 
************************************************************ 



AF055634 
UNC5C_exon6 
SEQ ID NO:l 
SEQ ID NO:3 



943 



1002 



TCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAAACGT 
TCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAAACGT 
TCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAAACGT 
TC CACCTGG ACGGAGTGGTCTGTGTGT AACAG C CGCTGTGGACGAGGGTATCAGAAACGT 
************************************************************ 



1003 



1062 



AF055634 ACAAGGACTTGTACCAACCCGGC^CCACTC^TGGGGGTGCCTTCTGTGAAGGGC 

UNC5C exon6 ACAAGX3ACTTGTACCAACCCGG CAC CACTCAATGGGGGTGCCTTCTGTGAAGGG CAGAGT 

SEQ ID NO:l ACAAGGACITGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTG^ 

SEQ ID NO: 3 ACAAGGACTTGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTGAAGGGCAGAGT 



AF05S634 
UNC5C_exon6 
UNC5C_exon7 
SEQ ID NO:l 
SEQ ID NO: 3 , 



i063 1122 
GTGCAGAAAATAGCCTGTACTACGTTATGCCCAGTGGATGGCAGGTGGACGCCATGGAGC 

jGTGCAGAAAATAGCCTGTACTACGTTATGCCCAG 

TGGATGGCAGGTGGACGCCATGGAGC 

gtgcagat^tagcctgtactacgttatgcccagtggatggcaggtggacgccatggagc 

GTGCAGAAAATAGCCTCTACTACXSTTATGCCC^GTGGATC^ 

************************************************************ 



AF055634 
UNC5C_exon7 
SEQ ID NO:l 
SEQ ID NO: 3 



1123 



1182 



AAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGCAGGAGGGAGTGCACGGCGCCA 
AAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGCAGGAGGGAGTGCACGGCGCCA 
AAGTGGTCTACTTGTGGAACTG AGTGCACCCACTGGCGCAGGAGG GAGTGCACGGCG CCA 
AAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGCAGGAGGGAGTGCACGGCGCCA 
************************************************************ 
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1183 



1242 



AF055634 
UNC5C_exon7 
SEQ ID NO:l 
SEQ ID NO: 3 



GCCCCCAAGAATGGA6GCAAGGACTGCGACGGCCTCGTCTTGCAATCCAAGAACTGCACT 
GCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTGCAATCCAAGAACTGCACT 
GCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTGCAATCCAAGAACTGCACT 
GC C C CCAAG AATGGAGG CAAGG ACTG CG ACGG C CT CGT CTTG CAATCCAAGAACTG CACT 
**************************************************'********** 



AF055634 * 
UNC5C_exon7 
Fusion spec exon 
SEQ ID NO:l 
SEQ ID NO: 3 



AF0S5634 

Fusion spec exon 
UNC5C_exon8 
SEQ ID NO:l 
SEQ ID NO: 3 



AF055634 
UNC5C__exon8 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exon8 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exon8 
UNC5cTexon9 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exon9. 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF05S634 
UNC5C_exori9 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exon9 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



1243 » M 
GATGGGCTTTGCATGCAGA 

GATGGGCTTTGCATG CAGA 

GTTTCATTTATCCCATTTCAACTGAACAGAGAACCCAGAAT 
GATGGGCTTTGCATGCAGAGTTTCATTTATCCCATTTCAACTGAACAGAGAACCCAGAAT 
G ATGGG CTTTGCATG CAGAGTTT CATT T AT C C CAT T T CAAC T G AACAGAGAAC C CAGAAT 
************************************************************ 



1303 



1362 



. . CTCCTCCTGATTCAGATGATGTTCCTCTC^ATCTTGGGATTGTG 

GAATATGGATTTTCTT „^ rTVtv ^ rT ,^ 

CTGCTCCTGATTCAGATGATGTTGCTCTCTATGTTGGGATTGTG 

GAATATGGATTTTCTTCTGCrCCrGATTCAGATGATGT^GCTCTCTATGTTGGGATTGTG 

GAATATGGATTTTCTTCTGCTCCTGATTCAGATGATGTTGCTCTCTATGTTGGGATTGTG 

********************* ***************** ************** 



********** * 



1363 



1422 

t tag C A GTGATCCTTTGCCTGGCGATCTCTGTAGTTGTGGCCTTGTTTGTGTATCGGAAG 
ATAGCMTGATaSTTTGCCTGGCGATCTCre 

ATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTGGCCTTGTTTGTGTATCGGAAG 

ATAGCAGTGATCGTTTGCCTGG03ATCTCTOTAGTTGTOGCC™ 

TTGTTTGTGTATCGGAAG 

************************************************************ 



1423 



1482 



AATCATCGTGACTITGAGTCAGATATTATTGACTCTTCGGCACTCAATGGGGGCTTTCAN 
AATCATCGTGACTTTGAGTCMATATTATTGACT 
AATCATCGTCACTTTGAGTCAGAT^^ 
AATC^TCGTGACrTTGAGTCAGATATTATTGACT 

AATCATCGTCACTTTGAGTCAGATATTATTGACTCTTCGGCACTCAATG 

*t**V****************************************************** 



1483 



1542 



CCTGTGAACATCAAGGCAGCAAGACAAGATCTGCTGGCTGTACCCCCAGACCTCACGTCA 

CCTGTG AACATCAAGGCAG C AAGACAAG 

ATCTGCTGGCTGTACCCCCAGACCTCACGTCA 

CCTGTGAACATCAAGGCAGCAAGACAAGATCTGCTGGCTGTACCCCCAGACCTCACGTCA 

CCTGTGAACATCAAGGCAGCAAGACAAGATCTGCTGGCTGTACCCCCAGACCTCACGTCA 

CCTGTG AACATCAAGGCAGCAAGACAAGATCTGCTGGCTGTACCCCCAGACCTCACGTCA 

************************************************************ 



1543 



1602 

GCTG CAGCCATGT ACAGAGGACCTGTCTATGCCCTGCATG ACGTCT CAG ACAAAATCCCA 
G CTG CAG C CATGT ACAGAG G AC CTG T CT ATG C C CTG C ATG ACGT CTCAGACAAAATC C CA 
GCTCCAGCCATGTACAGAGGACCTGTCTATCCCCTGCATGACGTCTCAGACAAAATCCCA 
GCTCCAGCCATCTACAGAGGACCTGTCTATGCCCrGCATGACGTCTCAGAC^AAAATCCCA 
G CTGCAG C CATGTACAGAGG ACCTGTCTATGCCCTG CATGACGTCTCAGACAAAATCCCA 
****^****************************************************** 



1603 



1662 



ATGACC^CTCTCCAATTCTGGATCCACTGCCCAACCTC^ 
ATCACCAACTCTCCAAT^CTGGATCCACTGCC 

ATG AC CAACT CTC CAATT CTGG ATC CAC TG C C CAAC CTG AAAAT CAAAGTGT ACAACAC C 
ATG AC C AACT CT C CAATT C TGG AT C CACTG C CCAAC C TG AAAAT CAAAGTGT AC AACAC C 
ATGACCAACTCTCCAATTCTGGATCCACTGCCCAACCTGAAAATCAAAGTGTACAACAC^ 

******t***************************************************** 



1663 



1722 



TCAGGTCCTGTCTCCCCCCAAGATGACCTCT 
TCAGGTGCTGTCACCCCCCAAGATGACCTCrrCrGAGT^rTACGTCCAAGCTC 
TCAGGTGCTOTCACCCCCCMGATGACCTCTCTGAGTTTACGTGC^GCTO 
TGAGGTGCTGTCACCCCCCAAGATGACCTCTCTCAGTTTACGTCCAAGCTGTC 
' TCAAGTGCTCTCACCCCCCAAGATGACCTCTra^ 
*** ******** ********************** 
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1723 1782 
AF055634 ATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCAGCCTGAAGAACCAGAGTCTAGCAAGG 
UNC5C_exoil9 m ATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCAGCCTGAAGAACCAGAGTCTAGCAAGG 
SEQ ID N0:1 ATGAC C CAGT CGTTGTTGG AGAATG AAGC CCTCAGCCTG AAGAAC CAGAGTCT AG CAAGG 

SEQ ID NO: 3 ATGA CC CAGTCG TTGTTGGAGAATGAAGC CCTCAGC CTGAAGAAC CAGAGTCT AG CAAGG 

AAC90914 ATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCAGCCTGAAGAACCAGAGTCTAGCAAGG 



1783 1842 
AFO 55634 GAGACTGATCCATCCTGTACC^CATTTGGCIAGC 

UNC5 C_exon9 CAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACTCGCTGGGAGGTC^ 

SEQ ID NO:l CAGACTGATCC^TCCTGTACCGC^TTTGGC^GCTTC^U^CTCACTGGGAG<3TC^CCTTATT 

SEQ ID NO: 3 CAGACTC5ATCCATCCITCTACCGCATTTGGC2AGCTTCAACTCA 

AAC 90914 CAGACTGATCCATCCTGTACCGC^TTTGGCAGeTTCAACTCGCrrGGGAGGTCACCTTATT 
***************************************** ****** **★****■**★★ * 

1843 1902 

AFO 5 5 6 3 4 G TTCC CAATT CAGG AGT CAG CTTG CTG ATTC CCG CTGGGG C CATTC C C CAAGGGAG AG TC 

UNC5C_exon9 GTTCCCAATTCAG 

UNC5C_exonl0 1 GAGTCAG CTTGCTGATTCCCG CTGGGGCCATTCCCCAAGGGAGAGTC 

SEQ ID NO:l GTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGGGGCCATTCCCCAAGGGAGAGTC 

SEQ ID NO: 3 GTTCCC^ATTC^GGAGTCIAGCT^GCTGAT/rCCCGCTGGGGCCATTCCCCAAGGGAGAGTC 

AAC 90914 GTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGGGGCCATTCCCCAAGGGAGAGTC 

1903 1962 
AFO 5 5 6 3 4 TACGAAATGTATGTGACTGTACACAGGAAAGAAACTATGAGGCCACCCATGGATGACTCT 
UNC 5 C_exonl 0 TACG AAATGT ATGTGACTGTACACAGG AAAGAAACTATG AG 

UNC5C_exonll GCCACCCATGGATGACTCT 
SEQ ID NO:l TAOTAAATGTATGTGACTCTACACAGGAAAGAAACTATCAGGCCACCCATGGATGACT.CT 
SEQ ID NO: 3 TACGAAATGTATGTGACTGTACACAGGAAAGAAACTATGAGGCCACCCATGGATGACTCT 
AAC 9 0914 TACGAAATGTATGTGACTGTACAC^GGAAAGAAACTATGAGGCCACCaVTGGATGACTCT 
*********************************************** ************* 



AF055634 
UNC5C_exonll 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



1963 ' 2022 

C^GACACITTTGACCCCTGTGGT<3AGCTGTGGGCCCCC!AGGAGCTOT 

CAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCCCAGGAGCTCTGCTCACCCGCCCA 

CAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCCCAGGAGCTCTGCTCACCCGCC 

CAGACACTTTTOACCCCTGTGGTGAGCTGTGGGCCCCCAGGAGCTCTGCTCACCCGCCCC 

CAGACACTTTTGACCCCTGTGGTGAGCTGTGGG CCCCCAGGAGCTCTGCTCACCCG CCCC 



AF0S5634 
UNC5C_exonll 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2023 



2082 



GTCGTCCTCACTATGCATCACTGCGCAGACCCCAATACCGAGGACTGGAAAATACTGCTC 
GTCGTCCTCACTATGCATCACTGCX5GAGACCCGAAT 

GTCGTCCTCACTATGCATCACTGCGCAGACCCCAATACCGAGGACTGGAAAATACTGCTC 
GTCGTCCTCACTATGCIATCACTG CGCAG ACC CCAAT ACCGAGGACTGGAAAAT ACXGCT C 
************************************************************ 



AF055634 
UNC5C_exonll 
UNC5C_exonl2 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2083 2142 
AAGAACC1AGGCAGCACAGGGACAGTGGGAGGATGTGGTGGTGGTCGGGGAGGAAAACTTC 
AAGAACC AGG CAG CACAGGG ACAGTGGGAG 

GATGTGGTGGTGGTCGGGGAGGAAAACTTC 
AAGAACCAGGCAGCACAGGGACAGTGGGAGGATGTGGTGGTGGTCGGGGAGGAAAACTTC 



AF055634 
UNC5C_exonl2 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2143 2202 
ACCACCCCCTGCTACATT AAG CTGGATG CAGAGGC CTGCCACATCCTCACAGAGAACCTC 
ACCACCCCCTG CT ACATTCAGCTGGATGCAGAGGCCTG C CACATCCTCACAGAGAACCT C 
ACCACCCCOTGCTACATTCAGCTGGATGCAGAGGCCTCCCACATCCTCACAGAGAACCTC 



AF055634 
UNC5C_exonl2 
SEQ ID NO:l- 
SEQ ID NO:3 
AAC90914 



2203 2262 
AG CACCTACGCCCTGGTAGGACATTCCACCACCAAAG CGGCTG CAAAGCGCCTCAAGCTG 
AG CACCTACGCCCTGGTAGGACATTCCACCACCAAAGCGGCTG CGAAGCG CCTCAAGCTG 
AG CACC TACG CCCTGGTAGGACATTCCACCACCAAAG CGGCTG CAAAGCGCCTCAAGCTG 
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AF055634 
UNC5C_exonl2 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2263 2322 
GCCATCTTTGGGCCCeTGTGCTGCTCCTCGCTGGAGTACAGCATCCGAGTCTACTGTCTG 
GCCATCTTTGGGCCCCTGTGCTGCTCCTCGCTGGAGTAG^ 

GCCATCTTTGGGCCCCTGTGCTGCTCCTCGCTGGAGTACAGCATCCGAGTCTACTGTCTG 

' CTCGCT GG AGT ACAG CATC CGAGT CT ACTGT C TG 

- - CTCGCTG GAGTACAGCATCCGAGTCTACTGTCTG 



AF05S634 
UNC5C_exonl2 
UNC5C_exonl3 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2323 2382 
GATGACACCC^GGATGCCCTGAAGGAAATTTTACATCTTGAGAGACAGACGGGAGGACAG 
GATGACACCCAGGATGCCCTGAAG 

GAAATTTTACATCTTGAGAGACAGACGGGAGGACAG 
GATGAC ACC CAGG ATGC CCTGAAGG AAATTTTACAT CTTGAGAGACAGACGGGAGGACAG 
GATGACACCCAGGATGCCCTGAAGGAAATTTTACATCTTGAGAGACAGACGGGAGGACAG 
GATGACACCCAGGATGCCCTGAAGGAAATTTTACATCTTGAGAGACAGACGGGAGGACAG 
************************************************************ 



AF05S634 
UNC5C exonl3 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2383 2442 
CTCCTAGAAGAAC CTAAGGCT CTT CATTTT AAAGG CAGCAC C CACAAC CTG CG C CTG TCA 
CT C CTAG AAGAAC CT AAGG CTCTT CATTTT AAAGG C AGCAC C CAC AACCTG CGC CTGT C A 
CTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGG GAG CACCCACAACCTGCGCCTGTCA 
CTCCTAGAAGAAC CTAAGGCTCTTCATTTTAAAGGC AGCACCCACAACCTG C^CCTGTCA 
CTC CTAG AAGAACCTAAGGCTCTTCATTTT AAAGG CAGCACCCACAACCTGCGCCTGTCA 



2443 2502 
AF05 S 63 4 ATT CACG AT ATCG CC CATT C C CT CTGGAAG AG C AAATTG CTGG CT AAAT ATCAGG AAATT 

UNCS C_exonl3 ATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGCTAAATATCAG 
UNC 5 C_exonl 4 G AAATT 

SEQ ID NO:l ATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGCTAAATATCAGGA 
SEQ ID NO: 3 ATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGCTAAATATCAGGAAATT 
AAC9 0914 ATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGCTAAATATCAGGAAATT 

************************************************************ 

2503 2562 
AF055634 CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTGCACTGC^ 

UNC5 C_exonl4 CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTGCACTGCACCTTCACTCTGGAA 
SEQ ID NO:l CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTGCACTGCACCTTCACTCTGGAA 
SEQ ID NO: 3. CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTG 
AAC9 0914 CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTGCACTGCACCTTC^ 

************************************************************. 



AP055634 
UNC5C_exonl4 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2563 2622 

AGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACTCTGTGTGCGGCAGGTGGAAGGA 

AG ATTT AG CCTGAACACAGTGGAGCTGGTTTG CAAACTCTGTGTG CGGCAGGTGGAAGGA 

AGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACTCTGTGTGCGGCAGGTGGAAGGA 

AGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACTCTGTGTGCGGCAGGTGGAAGGA 

AGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACrrCTGTGTGCC^ 

*********************************** ************************* 



AF055634 
UNC5C_exonl4 
UNC5C_exonl5 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2623 2682 
GAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGAGGAACCTACTGGCATCGATTTG 
GAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGAG 

GAACCTACTGGCATCGATTTG 
GAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGAGGAACCn'ACTGGC^TCGATTTG 
GAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGAGGAACCTACTGGCATCGATTTG 

GAAGGG CAGATCTTCCAGCTCAACTGCACCGTGTCAGAGGAACCTACTGG CATCGATTTG 
************************************************************ 



2683 2742 
AF055634 CCG CTG CTGGATCCTGCGAACAC CAT CAC CACGGTCACGGGGCCCAGTGCTTTCAG CATC 

DNC5 C_exonl 5 CCGCTGCTGGATCCTGCGAACACCATCACCACGGTCACGGGGCCCAGTGCTTTCAGC^ 
SEQ ID NO:l CCGCTGCTGGATCCTGCGAACACCATCACCACGGTCACGGGGCCCAGTGCTTTCAGCATC 
SEQ ID NO: 3 CCGCTGCTGGATCCTGCGAACACCATCACCACGGTCACGGGGCCCAGTGCTTTCAGCATC 
AAC9 0914 CCGCTGCTGGATCCTGCGAACACCATCACCACGGTCACGGGGCCCAGTGCTTTCAGC^T^ 



2743 2802 
AF 0 5 5 6 3 4 CCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGATGCCCCCCAGACGAGAGGCCAT 
UNCS C_exonl 5 CCTCTC CCTATCCG GCAGAAGCTCTGT AGCAGCCTGGATGCC CCCCAGACGAGAGGCCAT 

SEQ ID NO:l CCTCTCCCTATCCGGCAGAAGCTCTGTAGC^GCCTCGATGCCCCCCAGACGAGAGGCCAT 
SEQ ID NO: 3 CCTCTCCCTATCCGGCAGAAG CTCTGTAG CAGCCTGGATGCCCCCCAGACGAGAGGCCAT 

AAC 90914 CCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGATGCCCCCCAGACGAGAGGCCAT 

********************************************+***************• 
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AF055634 
UNC5C_exonlS 
UNC5C_exonl6 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AP055634 
UNC5C_exonl6 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exonl6 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 • 



AF055634 
UNC5C_exonl6 
SEQ* ID NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C exonlS 



AF055634 
UNC5C exonlS 
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2803 '2862 
GACTGGAGGATG CTGG C CCATAAGCTGAAC CTGG ACAGGTACTTGAATTACTTTGC CAC C 
GACTGGAGGATGCTGG CCCATAAG CTG AACCTGGACAG 

GTACTTGAATTACTTTGCCACC 
GACTGGAGGATG CTGGC C CATAAG CTG AAC CTGG ACAGGTACTTGAATTACTTTGC CAC C 
GACTGGAGGATG CTGG C C CATAAGCTGAAC CTG G ACAGGTAC!TTGAATTACTTTG C CAC C 
GACTGGAGGATG CTGGCCCATAAGCTGAACCTGGAC!AGGTACTTC 

********.**************************************************** 

2863 2922 

AAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATGGA 

AAATCCAGCCCMCTGGCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATGGA 

AAATCCAGCCCAACTGGCGTAATCC!TGGATCTTTGGGAAGCA(ZAGAA 

AAATCCAGCCCA^CTGGCGTAATCCTGGATCITTTGGGAAGC!ACAGAACTTCCCAGATGGA 

AAATCCAGCCCAACTGGC!GTAATCCTGGATC?TTTGGGAAGCAC!AGAACT 

********** ************************************ ************** 

2923 2982 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

AACCTGAGC'ATGCH'GGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACG 

AACCTGAGCATGCTGGC!AGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

2983 3003 3028 3042 

TTAGCAGCAGAAGGGCAGTATTAACCACGATGCTGGAAGGGGAAATGCAGTATTAACCAC 
TTAGCAGCJAGAAGGGCAGTATTAACCACCATGCTGGAAGGGGAAATGCAGTATTAACCA.C 

TTAGCAGCAGAAGGGCAGTATTAACCACCATGCTGGAAGGGGAAAT 

TT AGCAGCAGAAGGGC AGT ATT AAC CAC C ATG CTGG AAGGG G AAAT 

TT AG CAG CAG AAG GGC AGT AT 

************************************************************ 

3043 . 3102 

C^TGCTGGAAGGGGAAATGAAGGACAAAAATGCACAGGGAGTCTGTGGCCGTCCAGGTGA 
CATGCTGGAAGGGGAAATGAAGGACAAAAATGCACIAGGGAGTCTGTGGCCGTCCAGGTGA 
************************************************************ 

3103 3162 
ATCACAGCTGAGGAGGAAATCCAGATGAGACCAATGCACTTCACAGGCAAGAATGCAGCA 
ATCACAG CTG AGGAGGAAATCCAGATGAGACCAATG CACTTCACAGG CAAGACTGCAGCA 
**************************************************** ******* 



3163 3222 
AF 0 5 5 6 3 4 GGAGCCAGAAGGAAAAC^GATACAACTGCCCATGTACATGCCCACTTTACTCGGAGATCA 
UNC5C_exonl6 GGAGCCAGAAGGAAAACAGATACAACrGCCCATGTACATGCCCACTTTACTCGGACATCA 

******************************************************* **** 



AF055634 
UNC5C exonl6 



3223 3282 
TCACGGGAGTTAAGAAAAATTGTGTAAATTTGTACCTTGAATTTAGCTATCAACCTAATT 
TCACGGGAGTTAAGAAAAATTGTGTAAATTTGTACCTTGAATTTAGCTATCAACCTAATT 
************************************************************ 



3283 3342 
AF 0 5 5 6 3 4 TTCCTCTTAGTTGGGCTGTATGCTGTGTGGTACAGGATCTTACAGTTTCCTAGGAAACGC 
UNC5 C_exonl 6 TTCCTCTTAGTTGGGCTGTATGCTGTGTGGTACAGGATCTTACAGTTTCCTAGGAAACGC 

************************************************************ 

3343 3402 
AF055634 TTTTTATTG CT ATCCAGATATATG G AT AAACTTT CTTAACAAAC CCAATTT CT ACAAATG 

UNC5C_exonl6 TTTTTATTGCTATCCAGATATATGGATAAACTTTCTTAAC^ 

************************************************************ 

3403 3462 
AF0 55634 TTGTTTACATCAAATTGGACAGGGATGCAGACACTGTCC^TGGCTCGTTCT 
UNC5 C_exonl 6 TTGTTTACATCAAATTGGACAGGGATGCAGACACTGTCCATGGCTCGTTCTAT^ 

************************************************************ 



3463 3522 
AF055634 CAAATCATTTG AAGTTGAAG CTGTGGACIGGTTTGTTGTGT CTATTT CAG ATTAGT AATTT 

UNC5 C_exonl 6 CAAATCATTTG AAGTTGAAGCTGTGGACGGTTTGTTGTGTCTATTTCAGATTAGTAATTT 

************************************************************ 
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AF055634 
UNC5C exonl6 



3523 3582 
ACAGAGAAATCACAGACTTTTGCTACAAATCGTGTGCAT^ 

ACAGAGAAATCACAGACTTTTGCTACAAATCGTGTGC^TCAAGTGTCTC^GATAATCCTC 
************************************************************ 



AF055634 
UNC5C exonl6 



3583 3642 
CCATCAGTGTTCTGTTTCTAGAACTTGTAGAACCAGTGTTACTGTTTGTATCAGGGAAGT 
CCATCAGTGTTCTGTTTCTAGAACTTGTAGAACCAGTGTTACTGTTTGTATCAGGGAAGT 



AF055634 
UNC5C exoxil6 



3643 3702 
GGAGAATCTAAGTGTAAAAAAGAAATAACTAAGACTCCTATTCCTTGGAGGGACCCTTCT 
GGAGAATCTAAGTGTAAAAAAGAAATAACTAAGACTCCTATTCCTTGGAGGGACCCTTCT 
*************** ************************* ******** ************ 



AF055634 
UNC5C exonl6 



3703 3732 
GGTGCCCTTTGGGAATAAAGCTGTAGCACTGC 
GGTG C C CTTTGGG AATAAAG CTGTAG CACTG C 
******************************** 
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DNC5C exon 1 
UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 
RORal exon 1 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO* 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 
Fusion specific exon 
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SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 
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1 59 

MRKGLRATAARCGIXSLiGYLLQMLVLPALALLSASGTGSAAQ 

MRKG LRATAARCGI^LG YLLQMLVLP ALALLS ASGTGS AAQDDDF 

MESAPAAPDPAASBPGSSGADAAAGSRETPLNQESARKSEPPAPVRRQSYSSTSRDDDF 
MESAPAAPDPAASEPGSSGADAAAGSRETPLNQESARKSEPPAPVRRQSYSSTSRDDDF 
MESAPAAPDPAASEPGSSGADAAAGSRETPLNQESARKSEPPAPVRRQSYSSTSR 

** ** 

60 119 
FHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNIiYCKASPATQIYFKCNSEWVHQKDH 
FHELPETFPSDPPEPIiPHFLIEPEEAYIVKNKPVNIjYCKASPATQIYFKCNSEWVHQKDH 
FHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQIYFKCNSEWVHQKDH 
************************************************************ 

120 179 
I VDERVDETSGLI VREVS IEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYL 
I VDERVDETSGLI VREVS IEI S RQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRI AYL 
I VDERVDETSGLI VREVS IEISRQQVEELFGPEDYWCQCVAWS SAGTTKSRKAYVRIAYL 
************************************************************* 

180 239 
RKTFEQEPLGKEVSLEQEVLLQCRPPEGI PVAEVEWLKNED I IDPVEDRNFYITIDHNIjI 
RKTFEQEPLGKEVSLEQEVIJ^CRPPEGIPVAEVEWIjKNEDIIDPVEDRNFYITIDHNLI 
RKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPVEDRNFYITIDHNLI 
************************************************************ 

240 299 
IKQARLSDTANYTCVAKNIVAKRKSTTATVIVYWGGWSTWTEWSVCNSRCGRGYQKRTR 
I KQARLSDTANYTCVAKNI VAKRKSTTATVI VYVNGGWSTWTEWSVCNS RCGRGYQKRTR 
IKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGRGYQKRTR 
*************************************************** ********* 

# 

300 359 

TCTNPAPI^GGAFCEGQSVQKIACTTLCPVDGRWTPWSKWSTCGTECTHWRRRE^ 

TCTNPAPLNGGAFCEGQSVQKIACTTIiCPVDGRWTPWSiOTSTCGTECTHWRRRECT 

TCTWPAPLNGGAFCEGQSVQKiACTTLCPVDGRWTPWSKWSTCGTECTHWRRRECTAPAP 

************************************************************ 

360 419 

KNGGKDCDGLVLQS KNCTDGIiCMQ TAPDSDDVAIiYVGI VIA 

KNGGKDCDGIiVLQS KNCTDGLCMQS FI YPI5TEQRTQNEYGFS S APDSDDVALYVGI VI A 
KNGGKDCDGLVLQSKNCTDGLCMQS FI YP-I STEQRTQNEYGFS S APDSDDVALYVGI VIA 

SFIYPI STEQRTQNEYGFS 
************************ **************** 

420 479 
VI VCLAI S WVALFVYRKNHRDFESDI IDS SALNGGFQPVNIKAARQDLLAVPPDLTSAA 
VI VCLAI SVWAIiFVYRKNHRDFESD I IDS SALNGG FQPVNI KAARQDLLAVP PDLTSAA 
VIVCLAISVWALFVYRKNHRDFESDI IDS SALNGGFQPVNIKAARQDLLAVPPDLTSAA 
************************************************************ 

480 539 
AMYRG P VYALHDVS DKI PMTNS P I LD PLPNIiKIKVYNTSGAVS PQDDLSE FTS KLS PQMT 
AMYRGP VYALHDVSDKI PMTNS PILD PLPNLKI KVYNTSGAVTPQDDLSEFTSKLS PQMT 
AMYRG PVYALHDVSDKI PMTNS PI LDPLPNLKI KVYNTSGAVTPQDDLSEFTSKLS PQMT 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



540 599 
QSLLENEALSLKNQSLARQTDPSCTAFGSFNSLGGHLIVPNSGVSUjI PAGAI PQGRVYE 
QSLLENEALSLKNQSLARQTDPSCTAFGSFNSLGGHLIVPNSGVSLLI PAGAI PQGRVYE 
QSLLENEALSLKNQSLARQTDPSCTAFGS FNSLGGHLXVPNSGVSLLI PAGAI PQGRVYE 
************************************************************ 

600 ^ 659 

MYVTVHRKETMRP PMDD S QTLLTP WS CG ALLTRP VVLTMHHCAD PNTEDWKILLKN 
MYVTVHRKETMRPPMDD S QTLLT P WS QGP PpALLT RPWLTMHHCADPNTEDWKI LLKN 
MYVTVHRKETMRPPMDD SQTLLTP WSGGP^^ PNTEDWKI LLLA 

************************** *»&** ************************** * 
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SEQ ID NO: 2 
SEQ ID NO: 4 
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660 719 
QAAQGQWEDVVVVGEENFTT PCYIKLDAEACHI LTENLSTYAIjVGH STTKAAAKRLKLAI 
QAAQGQWEDVVWGEENFTTPCYIQLDAEACHILTENLSTYAIjVGHSTT^ 
GVQHPSLLSG 



UNCSC 

SEQ ID NO: 2 



720 779 
FGPLCCSSLEYS IRVYCLDDTQDALKEI LHLERQT^ 

FGPLCCSSLEYSIRVYCIiDDTQDAXjKEILHIjERQTGGQLLEEPKMjHFKGSTHNIjRLSIH 



UNC5C 

SEQ ID NO: 2 



780 839 
DIAHSLWKSKLIiAKYQEI PFYHVWSG SQRNLHCTFTIiERF SLNTVELVCKLCVRQVEGEG 
DIAHSLWKSKLIJ^QEIPFYHWSGSQRNLHCTFTLERFSLNTVELVCKLCVRQVEGEG 



UNCSC 

SEQ ID NO: 2 



840 B99 
QI FQLNCTVS EE PTG I DLPIiLD PANTITTVTG P S AFS I PL P IRQKIiCS S LDAPQTRGHDW 
QI FQIiNCTVSEEPIGIDLPLLDPANTITTVTGPS AFS I PLPI RQKLCS S LDAPQTRGHDW 



UNC5C 

ID NO: 2 



900 959 
RMLAHKLNLDRYLNYFATKS S PTGVI LDLWEAQNFPDGNLSMLAAVLEEMGRHET WSLA 
RMLAHKLNLDRYLNYFATKS S PTGVI LDLWEAQNF PDGNLS MLAAVLEEMGRHETWS LA 



UNC5C 

SEQ ID NO: 2 



960 

AEGQY 

AEGQY 
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